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OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



January 9, 2004, 12:21:36 ; Search time 39.6051 Seconds 
(without alignments) 
1795.466 Million cell updates/sec 

US-09-674-379A-13 
2533 

1 MPGIKRILTVTILALCLPSP INFRGSSVIRLRIYVSQYPF 448 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1107 863 seqs, 158726573 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1107863 



Database 



A__Geneseq_19Jun03 : * 

1 : / SIDS 1 /gcgda ta/genes eq/geneseqp -embl /AA1980 . DAT : * 

2: /S I DS 1 /gcgda t a /genes eq/ genes eqp - embl /AAl 981. DAT : * 

3 : / SIDSl/gcgdata/geneseq/geneseqp-embl/AA1982 . DAT: * 

4; /SIDS1 /gcgdata /genes eq/geneseqp- embl /AA1983 .DAT-.* 

5: /S I DS1 /gcgdata /genes eq/geneseqp - embl /AAl 984 .DAT:* 

6 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1985 .DAT: * 

7 : /SIDS1 /gcgdata /genes eq/genes eqp - embl /AAl 986. DAT : * 

8 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1987 .DAT: * 

9 : /SlDSl/gcgdata/geneseq/geneseqp-embl/AA1988 .DAT:* 

10 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1989 . DAT 

11 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1990.DAT 

12 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1991 . DAT 

13 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1992 . DAT 

14 -. /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1993 .DAT 

15 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1994 . DAT 

16 : / S I DS 1 / gcgda t a / gene s eq/ genes eqp - embl / AAl 995. DAT 

17 : /SlDSl/gcgdata/geneseq/geneseqp-embl /AAl 9 96 . DAT 

18 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1997 .DAT 

19 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1998 .DAT 

20 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1999 . DAT 

21 : /SIDS1 /gcgdata /genes eq/ geiaeseqp-eTnbl/AA2000 . DAT 

22 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2001 . DAT 

23 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2 002 .DAT 

24: /SIDS1/ gcgdata /genes eq/genes eqp-embl /AA2 003 . DAT 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 



and is derived by analysis of the total score distribution. 
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Novel human secret 
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Human secreted/tra 
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Human secreted/tra 
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99 
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99 
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Human PRO polypept 
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ABU56139 


Human secreted/tra 
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2527 


99 
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24 
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Human PRO polypept 
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99 
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24 


ABU10713 


Human secreted/tra 


23 


2405 


94 


9 


448 


21 


AAY56750 


Smooth muscle prol 


24 


2405 
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9 
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21 


AAY54990 


Full length mouse 


25 
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93 


8 
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21 


AAY56752 


Smooth muscle prol 


26 


2376 


93 


8 
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21 


AAY54 991 


Full length mouse 


27 


2302 


90 
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AAY56751 


Smooth muscle prol 


28 
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90 
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21 


AAY56753 


Smooth muscle prol 


29 


2230 
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AAW31705 


Human extracellula 


30 
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1 


335 


21 
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22 
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Skin cell protein, 
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Rat protein isolat 
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Human PR0226 prote 
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21 
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A human p53 mutant 
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9 
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AAY55850 
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22 


AAU1233 0 


Human PR0226 polyp 
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50 


9 


443 


23 


AAU86130 


Human PR0226 polyp 
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1289 


50 


9 


443 


24 


ABU66728 


Human PRO polypept 


41 


1289 


50 


9 


443 


24 


ABU67004 


Human secreted/tra 


42 


1289 


50 


9 


443 


24 


ABU59809 


Novel secreted and 


43 
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50 


7 


443 


22 
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44 
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6 
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21 


AAY84706 


Amino acid sequenc 


45 


1237.5 


48 


9 


433 


21 


AAB58353 


Lung cancer associ 



ALIGNMENTS 



RESULT 1 




AAW7973 9 




ID 


AAW79739 standard; Protein; 448 AA. 


XX 






AC 


AAW79739; 




XX 






DT 


25-JAN-1999 


(first entry) 


XX 






DE 


Human EEGF protein. 


XX 






KW 


Extracellular/ epidermal growth fact or- like protein; EEGF; human; liver 


KW 


vascular smooth muscle cell proliferation; neurology; pathology; AIDS; 


KW 


dementia; ocular; disorder; cornea; inflammation; tumour cell; kidney; 


KW 


wound; epithelium; surgery; skin graft; ulcer; diabetic; trophic; 


KW 


Marfan syndrome; treatment; hair follicle; embryogenesis; neoplasma; 


KW 


epidermal cell; cancer; psoriasis; detection. 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


• 

Location/Qualif iers 


FT 


Peptide 


1 . . 25 


FT 




/label= signal 


FT 


Protein 


26 . .448 


FT 




/label= EEGF 


FT 




/notes "Extracellular/epidermal growth factor-like 


FT 




protein" 


FT 


Region 


112 . . 153 


FT 




/label- EGF-1 


FT 


Region 


154. .190 


FT 




/label = EGF-2 


FT 


Region 


191. .230 


FT 




/label- EGF-3 


FT 


Region 


231. .271 


FT 




/ label = EGF-4 


FT 


Region 


272 . .314 


FT 




/ label = EGF-5 


XX 






PN 


W09846746-A1. 




XX 






PD 


22-GCT-1998. 




XX 






PF 


ll-APR-1997; 


97WO-US06020. 


XX 






PR 


ll-APR-1997; 


97WO-US06020. 


XX 






PA 


{HUMA-) HUMAN 


GENOME SCI INC. 


XX 






PI 


Li H, 01 sen 


HS; 


XX 






DR 


WPI; 1998-568728/48. 


DR 


N-PSDB; AAV62432. 


XX 






PT 


New isolated extracellular/ epidermal growth factor - used for 


PT 


regulating vascular smooth muscle cell proliferation, e.g. for 


PT 


enhancing neurological functions or treating neoplasia and other 


PT 


disorders . 




XX 







PS Claim 10a; Fig 1A-D; 62pp; English. 
XX 

CC This sequence represents a novel human extracellular/ epidermal growth 

CC factor-like protein, EEGF . This protein can be used to regulate 

CC vascular smooth muscle cell proliferation and for restoration or 

CC enhancement of neurological functions diminished as a result or other 

CC damaging pathologies such as AIDS dementia. The protein can also be used 



CC to treat senile dementia, ocular disorders such as corneal inflammation, 

CC for targeting tumour cells, for treating kidney disorders, for liver 

CC regeneration or treating liver dysfunction, for treating wounds including 

CC all cutaneous wounds, corneal wounds, and injuries to the 

CC epithelial-lined hollow organs of the body or resulting from trauma such 

CC as bums, abrasions and cuts as well as from surgical procedures such as 

CC surgical incisions and skin grafting. The polypeptides can also be used 

CC for treating chronic conditions/ such as chronic ulcers, diabetic ulcers, 



CC other non-healing (trophic) conditions, to treat Marfan syndrome, to 

CC promote hair follicular development, to stimulate growth and 

CC differentiation of various epidermal and epithelial cells in vivo and in 

CC vitro and to stimulate embryogenesis • Antagonists to EEGF can be used to 

CC treat neoplasia such as cancers or tumours, skin disorders such as 

CC psoriasis or corneal inflammation. The products can also be used for 

CC identifying EEGF receptors, detection, diagnosis and drug screening. 
XX 

SQ Sequence 448 AA; 



Query Match 100.0%; Score 2533; DB 19; Length 448; 

Best Local Similarity 100.0%; Pred. No. 4e-159; 

Matches 448; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 1 MPGIKRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 60 

llllllllllllMIIMIIIMIIIMIIIIIIIIIIIIMMIIIillllllllllll 

Db 1 MPGI KRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 



Qy 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 12 0 

IMMIIMIIIMIIIIIIIIIIIMIIflllllllllllllllllMIIIIIIIIIII 
Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 12 0 

Qy 121 ESMQCVDVDECATDSHQCN PTQ I C I NTEGG YTCSCTDG YWLLEGQCLDI DE CR YGYCQQL 180 

MIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIilllMIMMIIIIfMIIIMIII 

Db 121 ESNQCM)VDECATDSHQCNPTQI CI NTEGG YTCSCTDG YWLLEGQCLD I DECRYGYCQQL 180 

Qy 181 CAN VPGS YS CTCN PGFTLNEDGRS CQDVNECATENPCVQTCVNTYGS F I CRCDPGYELEE 240 

lllllllMIMinniMllMlllllllllMIMMIIIlilllllllllllllll 

Db 181 CAN VPGS YS CTCN PGFTLNEDGRS CQDVNE CATENPCVQTCVNT YGS F I CRCDPGYELEE 24 0 

Qy 241 DGVHCSDMDECS FSEFLCQHE CVNQ PGTYF CS CP PG Y I LLDDNRS CQD I NECEHRNHTCN 300 

Il]|lll11il!lljill)iiilllilll!lltllilililll]ll!llill!lllllll 

Db 241 DGVHCSDMDECS FSEFLCQHE CVNQPGTYFCS CP PG Y I LLDDNRS CQD I NECEHRNHTCN 3 00 

Qy 301 LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 360 

IIIIIlltlllltlllillltillilllilllitllllllMlilllllllllillllil 

Db 301 LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 360 

Qy 361 VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREI QLDL 420 

tillllllllltilililllllllliltllllllltiliilllilllllllliltillli 
Db 361 VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREI QLDL 420 



Qy 421 EMI TVNTVI NFRGS S VI RLR I YVS Q Y P F 448 

llllllllllllllllilllllllllli 
Db 421 EMI TVNTVI WFRGSS VI RLRIYVSQYPF 448 



RESULT 2 
AAW95709 

ID AAW95709 standard; Protein; 448 AA. 
XX 

AC AAW95709; 
XX 

DT 21-JUN-1999 (first entry) 
XX 

DE Homo sapiens fetal kidney clone AK647 secreted protein. 
XX 

KW Secreted protein; fetal kidney. 
XX 

OS Homo sapiens. 
XX 

PN WO9900405-A1. 
XX 

PD 07-JAN-1999. 
XX 

PF 29-JUN-1998; 98WO-US13530 . 
XX 

PR 30-JUN-1997; 97US-0885610 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Agostino MJ, Evans C, Jacobs K, Laval lie ER, Mccoy JM; 

PI Merberg D, Racie LA, Treacy M; 

XX 

DR WPI; 1999-095671/08. 

DR M-PSDB; AAXG7567. 
XX 

PT New polynucleotides encoding secreted human proteins - are derived 

PT from foetal kidney or adult retina cDNA libraries, used as, e.g. 

PT potential vaccines 
XX 

PS Claim 11; Pages 52-54; 76pp; English. 
XX 

CC The sequence is that of a secreted protein from a human fetal 

CC kidney clone AK296. Such a sequence is predicted to have biological 

CC activities which would make them suitable for treating, preventing or 

CC ameliorating medical conditions in humans and animals, although no 

CC supporting data is given. Suggested activities include nutritional 

CC activity, cytokine and cell proliferation/differentiation activity, 

CC immune stimulating (e.g. as vaccines) or suppressing activity, 

CC haematopoiesis regulating activity, tissue growth activity, 

CC activin/inhibin activity, chemotactic/chemokinetic activity, haemostatic 

CC and thrombolytic activity, receptor/ ligand activity/ ant i- inflammatory 

CC activity, cadherin/tumour invasion suppressor activity, and tumour 

CC inhibition activity. It is also stated to be useful for gene 

CC therapy. 

XX 

SQ Sequence 448 AA; 



Query Match 100.0%; 
Best Local Similarity 100.0%; 
Matches 448; Conservative 0, 



Score 2533; DB 20; 
Pred. No. 4e-159; 
Mismatches 0 ; 



Length 448; 



Indels 



0 ; Gaps 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



MPGI KR ILTVT I IALCLPS PGNAQAQCTNGFDLDRQSGQCLDI DECRT I PEACRGDMMCV 60 
MPGIKRILTVTI1ALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 60 



llllllllIilllllilliMEIIIMNiilllMllllilMIIIIIIII! 



lllllllllllilllllitlllMilllllllllllUlllllillilllM 



'ilMMiltlllllllltlltllllilllllll 



llllllllllllltlMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 



lllll 



llllllllllllllllljlllllllllllllllllllllllllllllllllllllllll 



lllll 



IIIIMIIIIIIIIIIIIINMIIIIIIIIIIIIIIIIIIII 



IMIIIIIIIllllllllllMIIII 



RESULT 3 
AAW94281 

ID AAVJ94281 standard; Protein; 44 8 AA. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PP 



AAW94281; 

07-MAY-1999 (first entry) 

Human extracellular matrix protein (ECMP)-l. 

Extracellular matrix protein; ECMP-1; diagnosis; treatment; cancer; 
immune disorder; human. 

Homo sapiens. 

WO9900410-A2. 

07-JAN-1999. 

23-JUN-1998; 98WO-US13012 . 



XX 

PR 27-JUN-1997; 97US- 0884 072 . 
XX 

PA (INCY-) INCYTE PHARM INC 
XX 

PI Bandman 0, Cor ley NC, Guegler KJ; 
XX 

DR WPI; 1999-095674/08. 

DR N-PSDB; AAX05359. 
XX 

PT New polynucleotide encoding extracellular matrix protein, ECMP-1 - 

PT useful in the diagnosis, prevention and treatment of immune 

PT disorders and cancer 
XX 

PS Claim 1; Fig 1A-G; 79pp; English. 
XX 

CC This represents a human extracellular matrix protein (ECMP)-l. Host 

CC cells containing a vector comprising the ECMP-1 nucleic acid are used 

CC for the recombinant production of the protein. ECMP-1 and its 

CC (ant) agonists, are useful in the diagnosis, prevention, and (treatment 

CC of cancer and immune disorders. 

XX 

SQ Sequence 448 AA; 

Query Match 100.0%; Score 2533; DB 20; Length 448; 

Best Local Similarity 100.0%; Pred. No. 4e-159; 

Matches 448; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


i 


MPGIKRI LTVT I LALCLPSPGNAQAQCTNG FDLDRQSGQCLD I DECRTI PEACRGDMMCV 

IIIIIIIIIIIllllllljlllllllllllllllllllllllMIIIIIIIIIIIIIII! 

MPG I KR2 LTVTI LALCLPSPGNAQAQCTNGFDLDRQSGQ CLDI DE CRTI PEACRGDMMCV 


60 


Db 


i 


60 


Qy 


61 


NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 

lllllitllllUllllllllllIllllllllilllllllllllilMllllllllilli 


120 


Db 


61 


NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 


120 


Qy 


121 


ESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 

|[||I!]!lll[|lllll[|IMIIIMMI.I]I[IIIMIIIIIIIillllflllllll 

ESNQCVDVDECATDSHQCNPTQI C I NTEGG YTCS CTDG YWLLEGQCLD I DECRYGYCQQL 


180 


Db 


121 


180 


Qy 
Db 


181 
181 


CANVPGSYS CTCN PGFTLNEDGRS CQDVNECATEN PCVQTCVNTYGSF I CRCDPGYELEE 

lllll[|fllll!l[||[|||[|lllll!ll!!l|jlIII!ll]|II!lllll!llllll 

CANVPGSYS CTCNPGFTLNEDGRS CQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEE 


240 
240 


Qy 


241 


DGVHCSDMDECS FS EFLCQHECVNQPGTYFCS C PPG YI LLDDNRS CQD I NECEHRNHTCN 

lllllilllllllllllillllllltlllllllitlltllllllllllllMIIIIIMI 

DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 


300 


Db 


241 


300 


Qy 


301 


LQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 

IIIIIIIIIIIIIIIMIIIIIIIIIIilllllllllllllllllllMIIIIIIIIIII 

LQQTCYNLQGGFKCIDPI RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 


360 


Db 


301 


360 


Qy 


361 


VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPISATLVMTRPI KGPREI QLDL 

[1! III! Mill!! MINIMI 11! II! MINI 

VPAD I FQMQATTR Y PGAY YI FQ I KSGNEGREF YMRQTGPI SATLVMTR P I KGPREI QLDL 


420 


Db 


361 


420 


Qy 


421 


EMITVNTVINFRGSSVI RLRI YVSQYPF 448 





miiiimimiiimmimi 

Pb 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 



RESULT 4 
AAY57058 

ID AAY57058 standard; Protein; 448 AA. 
XX 

AC AAY57058; 
XX 

DT 21-FEB-2000 (first entry) 
XX 

DE Amino acid sequence of the human secreted protein AK647. 
XX 

KW AK647; aortic tissue development; smooth muscle cell modulator; SCID; 

KW nutritional supplement; vasculogenesis; embryonic development; infection; 

KW cytokine activity; cell proliferation; cell differentiation; detect; HIV; 

KW immune deficiency; haematopoiesis regulation; tissue regrowth; diagnose; 

KW wound healing; restenosis; atherosclerosis; drug screen. 

XX 

OS Homo sapiens. 
XX 

PN WO9960125-A2. 
XX 

PD 25 -NOV- 199 9. 

XX 

PF 18-MAY-1999; 99WO-US10931 . 
XX 

PR 19-MAY-1998; 98US- 0081002 . 

PR 21-MAY-1998; 98US-0083002 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Jacobs K, McCoy JM, Racie h; La Vail ie E, , Treacy M, Evans C; 

PI Agostino M, Lu Z, Merberg D; 

XX 

DR WPI; 2000-053298/04. 

DR N-PSDB; AAZ398 92. 
XX 

PT Proteins, and their encoding polynucleotides, used for treating e.g. 

PT smooth muscle cell growth, vasculogenesis, restenosis or 

PT 'atherosclerosis - 
XX 

PS Claim 4; Page 46-47; 49pp; English, 
XX 

CC This is the amino acid sequence of the human secreted protein AK647 . The 

CC polynucleotide sequence was obtained from a human foetal kidney cDNA 

CC library. AK647 homologues in chicks and rodents are involved in aortic 

CC tissue development. The spatial and temporal distribution of AK647 

CC indicated that it acts as an a modulator of smooth muscle cells in 

CC vasculogenesis during embryonic development. The primary structure of 

CC AK647 consists of multiple EGF domains. The AK647 protein can be used as 

CC a nutritional source or supplement . The protein shows both inhibitory and 

CC inducing, cytokine, cell proliferation and cell differentiation activity. 

CC The protein may also be used in the treatment of immune deficiencies and 

CC disorders, including severe combined immunodeficiency (SCID) , HIV and 

CC other viral, bacterial and fungal infections. Regulation of immune 



CC responses may also be carried out by the AK647 protein. Other uses of the 

CC protein include a role in the regulation of haematopoiesis and in the 

CC treatment of myeloid and lymphoid cell deficiencies. Uses in bone, 

CC cartilage, tendon, ligament and nerve tissue regrowth are also possible, 

CC as well as for wound healing and in the treatment of ulcers and burns. 

CC The polynucleotides and proteins can be used for preventing, treating or 

CC ameliorating smooth muscle cell growth, vasculogenesis, restenosis, 

CC atherosclerosis, blood vessel remodelling and degeneration. The proteins 

CC may also have activin/inhibin, chemotactic/chemokinetic, haemostatic and 

CC thrombolytic, receptor/ligand, ant i- inflammatory, cadherin/ tumour 

CC invasion suppressor, and tumour inhibition activity- AK647 specific 

CC antibodies can be used for promoting smooth muscle cell growth or 

CC vasculogenesis- The proteins and polynucleotides can also be used for 

CC detection, diagnosis and drug screening. 

XX 

SQ Sequence 448 AA; 

Query Match 100.0%; Score 2533; DB 21; Length 448; 

Best Local Similarity 100.0%; Pred. No. 4e-159; 

Matches 448; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


l 


MPGIKRILTVTILALCLPSPGWAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 

llllllllUilillllllMIIIMlllUllUlllllilllilllMlUllilMI 


60 


Db 


l 


MPGI KRI LTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 


60 


Qy 


61 


NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 


120 


Db 


61 


lltllllilllltilllllllliillllltitillllllllllllllllillliilllil 
NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 


120 


Qy 


121 


ESNQCVDVDECATDSHQCNPTQI CI NTBGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 

MlllllllIllllllllllilllllilllljlillilllllilllillMllMliill 


180 


Db 


121 


ESNQCVDVDECATDSHQCNPTQI CI NTEGG YTCS CTDGYWLLEGQCLDI DECRYGYCQQL 


180 


Qy 


181 


CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 

lllilllllllMllllllllllMIIIIIIIIMIllIIIIIIMIIIIIflllllll! 

CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 


240 


Db 


181 


240 


Qy 


241 


DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 

llllllillllllllllMillllllllMIIIIIIIIIIMIIIIMIIMMIIIIII 

DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 


300 


Db 


241 


300 


Qy 


301 


LQQTCYNLQGGFKC I DP I RCEEP YLR I SDNRCMCPAENFGCRDQPFTILYRDMDWSGRS 


360 


Db 


301 


LQQTCYNLQGGFKCIDPIRCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 


360 


Qy 


361 


VPADI FQMQATTRYPGAY Y I FQI KSGNEGREFYMRQTGPI SATLVMTRPI KG PRE I QLDL 
IlllllSlllllillllllllllilllillllllllllillllllillilllilllllll 


420 


Db 


361 


VPADI FQMQATTRYPGAY Y I FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREI QLDL 


420 


Qy 


421 


EMI TVNTVI NFRGSS VI RLRI YVSQYPF 448 




Db 


421 


Itlllllltllllillllllilliltll 

EMI TVNTVINFRGSSVI RLRI YVSQYPF 448 





RESULT 5 
AAY54989 
ID AAY54989 



Standard; Protein; 448 AA. 



XX 

AC AAY54 98 9; 
XX 

DT 15-FEB-2000 (first entry) 
XX 

DE Pull length human A55 protein sequence. 
XX 

KW A55 protein; human; smooth muscle proliferation; tissue generation; 

KW vascular smooth muscle cell; arteriosclerosis; tissue reparation; myoma; 

KW vascular endothelial thickening; haematopoietic cell -regulator; cytokine; 

KW percutaneous transluminal coronary angioplasty; blood coagulation,- PTCA; 

KW actin; inhibin; chemotaxis; thrombosis; cadherin,- therapy; 

KW tumour metastasis inhibitor. 

XX 

OS Homo sapiens. 
XX 

PN W09955864-A1. 

XX 

PD 04-NOV-1999. 
XX 

PF 28-APR-1999; 99WO-JF02284 . 
XX 

PR 28-APR-1998; 98 JP-0119731 . 
XX 

PA (ONOY ) 0N0 PHARM CO LTD. 
XX 

PI Honjo T, Tashiro K, Nakamura T; 
XX 

DR WPI; 200Q-Q38647/03. 

DR N-PSDB; AAZ4 0027. 
XX 

PT Novel human polypeptides for treatment of, e.g. arteriosclerosis and 

PT myoma - 

XX 

PS Claim 1; Page 76-80; 87pp; Japanese. 
XX 

CC This sequence is the human A55 protein of the invention. The protein 

CC can be used for the treatment of diseases due to abnormal proliferation 

CC of smooth muscle. The polypeptides can be used according their inhibition 

CC of the proliferation of vascular smooth muscle cells, particularly in 

CC treating arteriosclerosis or re-narrowing by vascular endothelial 

CC thickening after percutaneous transluminal coronary angioplasty (PTCA) , 

CC or myoma, haematopoietic cell -regulatory activity, cytokine activity, 

CC tissue generation/reparation activity, actin/inhibin activity, taxis 

CC and chemotaxis activity, blood coagulation/ thrombotic activity, 

CC receptor/ligand activity, cadherin/ tumour metastasis inhibiting activity; 

CC tumour inhibition, and as a nutrient. 

XX 

SQ Sequence 448 AA; 

Query Match 100.0%; Score 2533; DB 21; Length 448; 

Best Local Similarity 100.0%; Pred. No. 4e-l59; 

Matches 448; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 MPGIKRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 60 

lillllllllllllllllillUllllllllllllllilllllllllillllllllllil 
Db 1 MPGIKRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 60 



Qy 


61 


NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 

lllllillllllliltlltillllllllllllllilllflliitlllillllMilltll 


120 


Db 


61 


NQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD 


120 


Qy 


121 


E SNQCVDVDECATDSHQCN PTQI C I NTEGGYTCSCTDGYVJLLEGQCLD I DECRYGYCQQL 

IIIIIillllllllllllllMMIilllMIIIMIIIIIIIMIIMIIllllIIMI 

ESNQCVDVDECATDSHQCNPTQI CI NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 


180 


Db 


121 


180 


Qy 


181 


CANVPGS YSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSF I CRCDPGYELEE 

IIIIIIIIMIMMIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIII 

CANVPGSY SCTCN PGFTLNEDGRS CQDVNECATEN PCVQTCVNT YG S F I CRCDPGYELEE 


240 


Db 


181 


240 


Qy 


241 


DGVHCSDMDECS FSEFLCQHECVNQ PGTYFCSCPPG Y I LLDDNRS CQD I NECEHRNHTCN 


300 


Db 


241 


illlllllltillilllllllllfllllilliilllllllllllllllllilllllllll 
DGVHCSDMDECS FS EFLCQHECVNQPGTYFCS CP PGY I LLDDNRS CQD I NECEHRNHTCN 


300 


Qy 


301 


LQQTCYNLQGGFKCI DP I RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 

llllfllllllllllllllllllMIIIIIIIIIIIIIIIIIIMIIllMlllflMII 
LQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 


360 


Db 


301 


360 


Qy 


361 


VPADI FQMQATTR Y PGAYY I FQI KSGNEGREFYMRQTGP I SATLVMTRPI KGPREI QLDL 

nillllllllllllliililllllllllllillllMMIIIIIIilillllllllill 


420 


Db 


361 


VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRPI KGPREI QLDL 


420 


Qy 


421 


EMITVNTVINFRGSSVIRLRI YVSQYPF 448 

illlMllllllilillillllllllil 




Db 


421 


EMITVNTVINFRGSSVIRLRI YVSQYPF 448 





RESULT 6 




AAM93573 




ID 


AAM93573 standard; Protein; 448 AA. 




XX 






AC 


AAM93573; 




XX 






DT 


06-NOV-2001 (first entry) 




XX 






DE 


Human polypeptide, SEQ ID NO: 3357. 




XX 






KW 


Human; full length cDNA; cDNA synthesis; 


oligo-capping. 


XX 






OS 


Homo sapiens. 




XX 






PN 


EP1130094-A2. 




XX 






PD 


05-SEP-2001. 




XX 






PF 


07-JUL-2000; 2000EP-0114089 . 




XX 






PR 


08-JUL-1999; 99JP-0194486 . 




PR 


ll-JAN-2000; 2000JP-0118774. 




PR 


02-MAY-2000; 2000JP-0183765 . 




XX 






PA 


(HELI-) HELIX RES INST. 




XX 






PI 


Ota T, Nishikawa T, Isogai T, Hayashi 


K, Ishii S, Kawai Y 



PI Wakamatsu A, Sugiyama T, Nagai K, Kojima S ( Otsuki T ( Koga H; 
XX 

DR WPI; 2001-524255/58. 

DR N-PSDB; AAK94505. 
XX 

PT 830 Primers useful for synthesizing full length cDNA clones and their 

PT use in genetic manipulation - 

XX 

PS Claim 8; SEQ ID NO 3357; 1380pp + sequence listing; English. 
XX 

CC The invention relates to primers for synthesising full length cDNA 

CC clones. 830 cDNA molecules encoding a human protein have been 

CC isolated and nucleotide sequences of 5 1 - and 3 ' -ends of the cDNA 

CC molecules have been determined. Primers for synthesising the full length 

CC cDNA are useful for clarifying the function of the protein encoded by 

CC the cDNA. The full length clones were obtained by construction of full 

CC length enriched cDNA libraries that were synthesised by the oligo -capping 

CC method. The primers enable the production of the full length cDNA easily 

CC without any special methods. The present sequence is a polypeptide 

CC encoded by a full length human cDNA of the invention. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in CD-ROM format directly from EPO. 

XX 

SQ Sequence 44 8 AA; 

Query Match 100.0%; Score 2533; DB 22; Length 448; 

Best Local Similarity 100.0%; Pred. No. 4e-159; 

Matches 44 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 MPGI KRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

lllllilllllililliltllltllllllllltllltllillilllilllllllllllll 
Db 1 MPGI KRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

Qy 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD 12 0 

IIIIIIMIIMIIIIIjlllllllllMllllllljllMMIIillllllllMIMI 
Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD 120 

Qy 121 ESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 180 

IIIMIIIIIIIIIIIMIMINIIIIIIMIIIIIIIIIIIMIIIIIIIIIIfllll 

Db 121 ESNQCVDVDECATDSHQCNPTQI CI NTEGGYTCS CTDGYWLLEGQCLDI DECRYGYCQQL 180 

Qy 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 240 

IIIIfllllllllllljlllllMMIIIIIIIIIIIlllllllllllilllllllllll 

Db 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 24 0 

Qy 241 DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 300 

llllllMIMMMIMillllllllllllllllllllililllilllllilllilil! 

Db 241 DGVHCS DMDECS FS E FLCQHECVNQPGTYFCSCP PGYI LLDDNRSCQD I NECEHRNHTCN 300 

Qy 301 LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 360 

Slllllllllllllltllllllilllltllllttitiliillltiliillililllilil 
Db 301 LQQTCYNLQGGFKC I DP I RCEE P YLR I SDNRCMCPAENPGCRDQP FTI LYRDMD WSGRS 360 

Qy 361 VPADI FQMQATTR Y PGAY YI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREI QLDL 420 

lIMMMIlllllillllMlllilllMIIIIMlMlllllllllMMIIIllIll 

Db 361 VPADI FQMQATTRYPGAY Y I FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREI QLDL 42 0 



Qy 

Db 



421 EM1TVNTVINPRGSSVIRLRIYVSQYPP 448 

iiiiimmiiitmimiiiiii 

421 EMI TVNTVI NFRGSS VI RLR I YVSQYPF 448 



RESULT 7 
AAU754 94 

ID AAU75494 standard; Protein; 448 AA. 
XX 

AC AAU75494; 
XX 

DT 23-APR-2002 (first entry) 
XX 

DE Human extracellular protein-like/EGF- like protein, EEGF. 
XX 

KW Human; extracellular protein-like protein; EGF-like; 

KW protein; epidermal growth factor; EEGF; ATCC 97285; gene therapy; 

KW vascular smooth muscle cell proliferation; Marfan syndrome; 

KW wound healing; neurological trauma; acquired immunodeficiency syndrome; 

KW AlDS-related dementia; ocular disorder; kidney disorder; liver disorder; 

KW hair follicle growth promotion; burn; ulcer; corneal incision; 

KW corneal inflammation; neoplasm; psoriasis. 

XX 

OS Homo sapiens . 



XX 

FH Key Location/ Qualifiers 

FT Peptide 1..25 

FT /label= Signaljpeptide 

FT Protein 26..448 

FT /label = Mature_EEGF 

FT /note= "This region is specifically claimed in claim 10" 

FT Domain 112.. 153 

FT /label = EGF_l_domain 

FT /note= "This domain is specifically claimed in claim 10" 

FT Domain 154.. 190 

FT /labels EGF_2_domain 

FT /note= "This domain is specifically claimed in claim 10" 

FT Domain 191.. 230 

FT /label* EGF_3_domain 

FT /note= "This domain is specifically claimed in claim 10" 

FT Domain 231,. 271 

FT /label = EGF_4_domain 

FT /note= "This domain is specifically claimed in claim 10" 

FT Domain 272.. 3 14 

FT /label = EGF_5_domain 

FT /note= "This domain is specifically claimed in claim 10" 

XX 



PN US2001051358-A1. 
XX 

PD 13-DEC-2001. 
XX 

PF 25-MAR-1999; 99US-0275805 . 
XX 

PR ll-APR-1997; 97US-0839525 . 

PR 10-APR-1996; 96WO-US05247 . 
XX 

PA (OLSE/) OLSEN H S- 



PA (LIHH/) LI H. 
XX 

PI Olsen HS, Li H; 
XX 

DR WPI; 2002-121417/16. 

DR N-PSDB; ABK13627. 
XX 

PT Mew nucleic acid encoding human extracellular/epidermal growth factor, 

PT useful for diagnosis and treatment of e.g. Marfan syndrome and wounds, 

PT also related polypeptides - 
XX 

PS Claim 10; Fig 1; 22pp; English. 
XX 

CC The invention relates to a novel polynucleotide which is at 

CC least 95% identical with a sequence (ATCC 97285) encoding mature human 

CC extracellular protein-like/epidermal growth factor (EGF) -like protein, 



CC EEGF. Also included are the EEGF EGF domains, a vector containing 

CC the polynucleotide, a host cell containing the vector, anti-EEGF 

CC antibodies and antagonists of EEGF. The polynucleotide is used for 

CC recombinant production of EEGF, in gene therapy, as hybridisation probes, 

CC as antisense antagonists and for chromosome identification. The protein 

CC is used to treat patients who require EEGF, to identify specific 

CC antagonists, used to treat conditions that require inhibition of EEGF 

CC (e.g. vascular smooth muscle cell proliferation, Marfan syndrome, wound 

CC healing, neurological trauma, acquired immunodeficiency syndrome 

CC (AIDS) -related dementia, ocular disorders, kidney disorders, liver 

CC disorders, hair follicle growth promotion, burns, ulcers, corneal 

CC incisions, corneal inflammation, neoplasms and psoriasis) , to raise 

CC specific antibodies and to characterise receptors. The present 

CC sequence represents EEGF. 

XX 

SQ Sequence 448 AA; 



Query Match 100.0%; Score 2533; DB 23; Length 448; 

Best Local Similarity 100.0%; Pred. No. 4e-159; 

Matches 448; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
Qy 1 MPGIKRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

IMIIIIIIIIMIMIIilllilllilllllllllllllllllllllMIMIIIIII! 

Db 1 MPGI KRI LTVTI LALCLPSPGWAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 60 

Qy 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPY PAAAPPLSAPNYPTI SRPLI CRFGYQMD 120 

IIIMIIilMIIIIIIIIIIIIIIIIIMllllllllMllllllllfiillllllllj 

Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 120 

Qy 121 ESMQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 180 

lll!!l(llilllll!llilllil!Mlillll!ltillllll!IIUIlllil!li!ll 

Db 121 ESNQCVD VDECATDSHQCNPTQ I C I NTEGG YTCS CTDG YWLLEGQCLD I DE CRYG YCQQL 180 

Qy 181 CANVPGS YSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSF I CRCDPGYELEE 24 0 

liiMiiiiiiiiiiiiiiiiiiiiiiiitiiiiiiiiiiitiitiitiiiniiiiiii 

Db 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEE 240 

Qy 241 DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 300 

llllMilMIIIIIIIMIIIffllllllilllllllllllllMMIIflllllllll 

Db 241 DGVHCSDMDECS FS EFLCQHECVNQPGTYFCS CP PG YI LLDDNRS CQD I NECEHRNHTCN 3 00 



Qy 301 LQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 360 

[[fllllllliMllilMIIIIIMIIMMMIIIIIMIIIIIIMIIIIIMIIII 

Db 301 LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 360 

Qy 361 VPADI FQMQATTRYPGAYY I FQ I KSGNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDL 420 

IIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIMIMIIIMIIIIIIIMIIIf 

Db 3 61 VPADI FQMQATTRYPGAYY I FQ I KSGNEGRE F YMRQTG P I SATLVMTRP I KG PRE I QLDL 42 0 

Qy 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 

iiiimiiiiiimiiiimimi 

Db 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 

RESULT 8 
AAY08063 

ID AAY08063 standard; Protein; 448 AA. 
XX 

AC AAY08063; 
XX 

DT ll-SEP-2000 (first entry) 
XX 

DE Human EGF-like homologue protein (PR0217) encoded by DNA32279 cDNA. 
XX 

KW Inflammatory cell infiltration; immune response; T cell proliferation; 

KW anti- inflammatory; ant i -auto immune; ant i -diabetic; spondyloarthropathy; 

KW T cell-mediated disease; spondyloarthropathy; sclerosis; renal disease; 

KW inflammatory myopathy; hemolytic anemia; thrombocytopenia; thyroiditis; 

KW diabetes mellitus; demyelinating polyneuropathy; Guillain-Barre syndrome; 

KW multiple sclerosis; polyneuropathy; hepatitis; cirrhosis; enteropathy; 

KW sclerosing cholangitis; inflammatory bowel disease; Whipple's disease; 

KW skin disease; dermatitis; psoriasis; asthma; allergic rhinitis; tumor; 

KW food hypersensitivity; urticaria; eosinophilic pneumonia; transplant; 

KW idiopathic pulmonary fibrosis; graft rejection; PR0245; human; PR0217; 

KW EGF-like. 
XX 

OS Homo sapiens - 
XX 

PN W09914241-A2. 
XX 

PD 25-MAR-1999. 
XX 

PF 17-SEP-1998; 98WO-US19437 . 
XX 

PR 17-SEP-1997; 97US-0059119. 

PR 18-SEP-1997; 97US-0059263 . 

PR 28-OCT-1997; 97US-0063550 . 

PR 12-NOV-1997; 97US-0065186 . 

PR 21-NOV-1997; 97US-0066364 . 

PR 24-N0V-1997; 97US-006677 0 . 

PR 04-JUN-1998; 98US-0088026 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Fong S, Goddard A, Gurney AL, Tumas D, Wood WI; 
XX 

DR WPI; 1999-229499/19. 

DR N-PSDB; AAX37670. 



XX 

PT Composition containing novel polypeptide PR0245, its agonist or 

PT antagonist - 

XX 

PS Example 1; Fig 6A; I77pp; English. 
XX 

CC This invention describes a novel composition containing (apart from a 

CC carrier or excipient) , a novel PR0245 polypeptide (I) , its agonist or 

CC antagonist, or their fragments, for modulating: (i) infiltration of 

CC inflammatory cells into tissue; (ii) an immune response; or (iii) T cell 

CC proliferation. The composition increases or decreases any of the effects 

CC (i)-(iii). The products of the invention have ant i- inflammatory , 

CC ant i -autoimmune and anti-diabetic activity, (I), and its (ant ) agonists 

CC and their fragments , are used to treat immune-related diseases , 

CC particularly T cell-mediated diseases. The diseases treated include 

CC systemic lupus erythematosus, rheumatoid arthritis, juvenile chronic 

CC arthritis, spondyloarthropathies, systemic sclerosis (scleroderma), 

CC idiopathic inflammatory myopathies (dermatomyositis , polymyositis), 

CC Sjogren's syndrome, systemic vasculitis, sarcoidosis, autoimmune 

CC hemolytic anemia (immune pancytopenia, paroxysmal nocturnal 

CC hemoglobinuria) , autoimmune thrombocytopenia (idiopathic thrombocytopenic 

CC purpura immune-mediated thrombocytopenia), thyroiditis (Grave's disease, 

CC Hashimoto's thyroiditis, juvenile lymphocytic thyroiditis, atrophic 

CC thyroiditis), diabetes mellitus, immune -media ted renal disease 

CC (glomerulonephritis, tubulointerstitial nephritis), multiple sclerosis, 

CC idiopathic demyelinating polyneuropathy, Guillain-Barre syndrome, chronic 

CC inflammatory demyelinating polyneuropathy, infectious hepatitis 

CC (hepatitis A, B, C, D, E and other non-hepatotropic viruses) , autoimmune 

CC chronic active hepatitis, primary biliary cirrhosis, granulomatous 

CC hepatitis, and sclerosing cholangitis, inflammatory bowel disease 

CC (ulcerative colitis: Crohn's disease), gluten-sensitive enteropathy, and 

CC Whipple's disease. Autoimmune or immune -mediated skin diseases including 

CC bullous skin diseases, erythema multiforme, contact dermatitis, psoriasis, 

CC asthma, allergic rhinitis, atopic dermatitis, food hypersensitivity, 

CC urticaria, eosinophilic pneumonia, idiopathic pulmonary fibrosis, 

CC hypersensitivity pneumonitis, and transplantation associated diseases 

CC (graft rejection, and graft-versus -host-disease) . (I), its (ant) agonists 

CC or fragment can also be used as an adjuvant in treatment of tumors. 

CC Antibodies against (I) can also be used for diagnosing such diseases. 

CC This sequence represents a human EGF-like homologue encoded by cDNA clone 

CC DNA32279 which is described in the invention. 

XX 

SQ Sequence 448 AA; 

Query Match 99.8%; Score 2527; DB 20; Length 448; 

Best Local Similarity 99.8%; Pred- No. 9.9e-159; 

Matches 447; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MPGIKRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 60 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 1 MPGI KRI LTVTI LALCLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV GO 

Qy 61 NQNGGYLCI PRTNP VYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 120 

lilllllMlllfllllMIIIIIIMMIMIIIIIIflllllllflllllllllllll 

Db 61 WQNGGYLC1 PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 12 0 



Qy 121 ESNQCVD VDECATDSHQ CN PTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGY CQQL 180 



miiiiiiiiimiiiiimmmiiimmiimiimiiimiiiiii 

Db 121 ESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 180 

Qy 181 CANVPGS YS CTCMPGPTLNEDGRS CQDVNECATEN PCVQTCVNTYGSF I CRCDPGYELEE 240 

iilllilllllllllitlllilltlilitflllilllllllllilSI ISIIIillllll 
Db 181 CANVPGS YS CTCNPGFTLNEDGRS CQDVNECATEN P CVQTCVNTYGSLI CRCDPGYELEE 240 

Qy 241 DGVHCSDMDECS FS EFLCQHECVNQPGTYFCS CP PGY I LLDDNRSCQDI NECEHRNHTCN 300 

l]]]!i!ll))illlllllllllllt!ftlllfllll|]il!lll!llllllll!lllll 

Db 241 DGVHCSDMDECS FS EFLCQHECVNQPGTYFCS CP PGY I LLDDNRSCQDI NECEHRNHTCN 300 

Qy 301 LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 360 

IjllllllllllllllllllMIINIIfllMIIIIIIIMIIIIIIMIIIIIMIII 

Db 301 LQQTCYNLQGGFKCI DPIRCEEPYLRISDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 360 

Qy 361 VPADI FQMQATTRYPGAYY I FQI KSGNEGREFYMRQTGP I SATLVMTRP I KGPRE I QLDL 420 

|j[llllll[II]j]||IIMIM!![|f|]|ll[!III]lll!lll[IIIIIIIIII]l 

Db 361 VPADI FQMQATTRYPGAYY I FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPRE I QLDL 420 

Qy 421 EMI TVNTVI NFRGSS VI RLR I YVS QY PF 448 

IIIIIIIIIIIMIillilMIIMIN 

Db 421 EMI TVNTVI NFRGSS VI RLR I YVS QYPF 448 

RESULT 9 
AAU29227 

ID AAU29227 standard; Protein; 448 AA. 
XX 

AC AAU29227; 
XX 

DT 18-DEC-2001 (first entry) 
XX 

DE Human PRO polypeptide sequence #204. 
XX 

KW PRO polypeptide; mammal; tumour; cancer; human; cattle; horse; sheep; 

KW dog; cat; pig; goat; rabbit; tumour necrosis factor alpha; TNF-alpha; 

KW blood; chondrocyte cell; cell proliferation; cell differentiation; colon; 

KW adrenal; lung; breast; prostate; rectum; cervix; liver; genetic disorder. 
XX 

OS Homo sapiens, 
XX 

PN WO200168848-A2. 
XX 

PD 20-SEP-2001. 
XX 

PF 28-FEB-2001; 20O1WO-US06520 . 
XX 

PR 01-MAR-2000; 2000WO-US05601 . 

PR 02-MAR-2000; 2000WO-US05841 . 

PR 03-MAR-2000; 2000US-187202P. 

PR 06-MAR-2000; 2000US-186968P. 

PR 14-MAR-2000; 2000US-189320P. 

PR 14-MAR-2000; 2000US-189328P. 

PR 15-MAR-2000; 2000WO-US06884 . 

PR 21-MAR-2000; 2000US-190828P. 

PR 21-MAR-2000; 2000US- 191007P . 

PR 21-MAR-2000; 2000US-191048P- 



PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
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Pan J, Smith V, Watanabe CK 
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Novel nucleic acids encoding PRO polypeptides, used to diagnose the 
presence of tumours, such as prostate and breast tumours, in mammals and 
to screen for modulators of the compounds - 

Claim 11; Fig 4 08; 774pp; English. 

Sequences AAU2 9024-AAU29328 represent PRO polypeptides of the invention. 
The PRO polypeptides and their associated nucleic acids can be used to 
detect the presence of a tumour in a mammal by comparing the level of 
expression of a PRO polypeptide in a test sample of cells from the animal 
and a control sample of normal cells, whereby a higher level of 
expression in the test sample indicates the presence of a tumour in the 
mammal. Mammals include dogs, cats, cattle, horses, sheep, pigs, goats 
and rabbits but are preferably human. The polypeptides can be used to 
stimulate tumour necrosis factor (TNP) alpha release from human blood, 
when contacted with it. A specific polypeptide can be used to stimulate 
the proliferation or differentiation of chondrocyte cells. The PRO 
proteins can be used to determine the presence of tumours and also 
susceptibility to tumour development, particularly adrenal, lung, colon, 



CC breast, prostate, rectal, cervical, or liver tumours , in mammalian 

CC subjects. The oligonucleotide probes specific for the PRO nucleic acids 

CC can be used for genetic analysis of individuals with genetic disorders. 
XX 

SQ Sequence 448 AA; 



Query Match 99.8%; Score 2527; DB 22; Length 448; 

Best Local Similarity 99.8%; Pred. No. 9.9e-159; 

Matches 447; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
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420 


Db 


361 


420 


Qy 


421 


EMITVNTVINFRGSSVIRLRI YVSQYPF 448 

, 1 ' . , ' , . . I I ■ ' I ' ; 11 

EMITVNTVINFRGSSVIRLRI YVSQYPF 448 




Db 


421 





RESULT 10 
AAB31183 

ID AAB31183 standard; Protein; 448 AA. 
XX 

AC AAB31183; 
XX 

DT 20-APR-2001 (first entry) 
XX 

DE Amino acid sequence of human polypeptide PRO210- 
XX 

KW Human; secreted protein; transmembrane protein; PR0196; PR0444; PR0183; 

KW PR0185; PRO210; PR0215; PR0217; PR0242; PR0288; PR0365; PR01361; PRO1308; 

KW PR01183; PR01272; PR01419; PR04999; PRO7170; PR0248; PR0353; PR01318; 

KW PRO1600; PRO994 0; PR0533; PRO301; PR0187; PR0337; PR01411; PR04356; 



KW PR0246; PR0265; PR0941; PRO10096; PR06003; PR06004; PRO350; PRO2630; 

PRO6309; cell death; genetic disorder; transgenic animal; gene therapy. 

XX 

OS Homo sapiens . 
XX 
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FT Modified-site 
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FT Modified-site 
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Location/Qualifiers 
1. .25 

/note= "signal peptide" 
21. .27 

/note= "N-myristoylation site" 
54. .57 

/note= "cell attachment site" 
64. .70 

/note= "N-myristoylation site" 
144. .156 

/note= "aspartic acid and asparagine hydroxylation site" 
149.. 155 

/note= "M-myristoylation site" 
186. .192 

/note= "N-myristoylation site" 
226. .232 

/note= "N-myristoylation site" 
242. .248 

/note- "N-myristoylation site" 
267.. 273 

/note= "N-myristoylation site" 
283. .287 

/note= "N-glycosylation site" 
296. .300 

/note= "N-glycosylation site" 
310..316 

/note= "N-myristoylation site" 



PR 20-MAR-2000; 2000WO-US07377 . 

PR 30-MAR-2000; 2000WO-US08439 . 

PR 15-MAY-2000; 2000WO-US13358 . 

PR 17-MAY-2000; 2000WO-US13705 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Ashkenazi AJ, Baker KP, Bot stein DA, Desnoyers L, Eaton DL; 

PI Ferrara N, Fong S, Gao W, Gerber H, Gerritsen ME, Goddard A; 

PI Godowski PJ, Gurney AL, Kljavin I J, Mather JP, Napier MA, Pan J; 

PI Paoni NF, Roy MA, Stewart TA, Tumas D, Watanabe CK, Williams PM; 

PI Wood WI, Zhang Z; 

XX 

DR WPI; 2001-050091/06. 

DR N-PSDB; AAC86968. 
XX 

PT Isolated nucleic acid molecule encoding a PRO polypeptide which is a 

PT transmembrane polypeptide is useful for gene therapy and identification 

PT of related polypeptides - 
XX 

PS Claim 12; Fig 10; 244pp; English. 
XX 

CC The present sequence represents a human secreted and transmembrane 

CC polypeptide. The specification describes human polypeptides, designated 

CC PR0196, PR0444, PR0183, PR0185, PRO210, PR0215, PR0217, PR0242, PR0288, 

CC PR0365, PR01361, PRO1308, PR01183, PR01272, PR01419, PR04 999, PRO7170, 

CC PR0248, PR0353, PR01318, PRO1600, PRO9940, PR0533, PRO301, PR0187, 

CC PR0337, PR01411, PR04356, PR024 6, PR0265, PR0941, PRO10096, PRO6003, 

CC PRO6004, PRO350, PRO2630 and PR06309. The biological activity of cells 

CC can be modulated with agents that bind to these polypeptides, resulting 

CC in the death of the cells. The polynucleotides encoding these 

CC polypeptides are useful in the recombinant production of the 

CC polypeptides, as a hybridisation probe to screen libraries to isolate 

CC homologous sequences, or to map the gene. They may also be used for 

CC analysing genetic disorders, and to produce transgenic animals which are 

CC useful for the development and screening of therapeutically useful 

CC reagents. The polynucleotides can also be used in gene therapy e.g. to 

CC replace a defective gene. 

XX 

SQ Sequence 448 AA; 



Query Match 99.8%; Score 2527; DB 22; Length 448; 

Best Local Similarity 99.8%; Pred. No. 9.9e-159; 

Matches 44 7; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MPGIKRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 
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Db 1 MPGI KRI LTVTI LALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

Qy ^l NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD 120 

H II Ml 1 1 HI I Ml M II I III I llll II M I il II III ill III II III II III III 

Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD 120 

Qy 121 ESNQCVDVDECATDSHQCNPTQ I C I NTEGGYTCS CTDGYWLLEGQCLD I DE CRYGYCQQL 180 
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360 


Qy 


361 


VPADI FQMQATTRY PGAYY I FQI KSGNEGREFYMRQTGPI SATLVMTRP I KGPREI QLDL 
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ABU71315 

ID ABU71315 standard; Protein; 448 AA. 
XX 

AC ABU71315; 
XX 

DT 10-JUN-2003 (first entry) 
XX 

DE Human PRO210 protein. 
XX 



KW Human; PRO; secreted; transmembrane; cytostatic; TNF-alpha; blood; 

KW tumour necrosis factor alpha release; chondrocyte cell; proliferation; 

KW differentiation; tumour; gene therapy. 
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OS Homo sapiens . 
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CC The invention describes an isolated, secreted and transmembrane 

CC polypeptide (PP) , termed PRO PP or fibroblast growth factor receptor PP 

CC (I). (I) is useful for detecting PR0533, PRO301, PR0187, PR0337, 

CC PR01411, PRO10096, PR0246, PRO6307, PRO6003, fibroblast growth factor 
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RESULT 14 
ABU66105 

ID ABU66105 standard; Protein; 448 AA. 
XX 

AC ABU66105; 



XX 

DT 20-MAY-2003 (first entry) 
XX 

DE Novel human secreted and transmembrane protein PRO210. 
XX 

KW Human; secreted protein; transmembrane protein; cytostatic; 

KW gene Therapy; TNF -Agonist -Alpha ; chondrocyte stimulator; tumour 

KW adrenal tumour; lung tumour; colon tumour; breast tumour; 

KW prostate tumour; rectal tumour; cervical tumour; liver tumour. 

XX 

OS Homo sapiens . 
XX 

PN US2003036157-A1. 
XX 

PD 20-FEB-2003. 
XX 

PF 02-JUL-2002; 2002US- 0188769 . 
XX 

PR 16-SEP-1998; 98WO-US19330 . 

PR 07-OCT-1998; 98WO-US21141 . 

PR 01-DEC-1998; 98WO-US25108 . 

PR 08-MAR-1999; 99WO-US05028 . 

PR 14-MAY-1999; 99WO-US10733 . 

PR . 02-JUN-1999; 99WO-US12252 . 

PR 01-SEP-1999; 99WO-US20111 . 
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PR 01-DEC-1999; 99WO-US28301 . 
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PR 20-JUN-2 001; 2001WO-US19692 . 
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PR 09-JUL-2001; 2001WO-US21735 . 
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Query Match 99.8%; Score 2527; DB 24; Length 448; 

Best Local Similarity 99.8%; Pred. No. 9.9e-159; 

Matches 447; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MPGI KRILTVTI LALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

iltlllilliilllilllllliillllltlltlillliilillltlilililllliltii 
Db 1 MPGI KRILTVTI LALCLPSPGNAQAQCTNGPDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

Qy 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 12 0 

llMlllllllMllllllllllllllMllllNliilliliiMlUniKllllll 

Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 12 0 

Qy 121 ESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 180 

MMIIIMIIIIMIMIIIIIIMlllllllllllllllilllllMlllllllllll 

Db 121 ESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 180 

Qy 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 240 

llllllllllllllllilllllllilllllllllllllllljlllM lllllljlllll 
Db 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSLICRCDPGYELEE 24 0 
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Db 361 VPADI FQMQATTRY PGAYY I FQI KSGNEGR EF YMRQTG P I SATLVMTRP I KGPRE I QLDL 420 



Qy 421 EMITVNTVINFRGSSVIRLRIYVSOYPF 448 

IMIMliMIIIMIIIIIIIIIIill 
Db 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 

RESULT 15 
ABU67141 

ID ABU67141 standard; Protein; 448 AA. 
XX 

AC ABU67141; 
XX 

DT 28-MAY-2003 (first entry) 
XX 

DE Novel human secreted and transmembrane protein PRO210. 
XX 

KW Secreted and transmembrane polypeptide; PRO polypeptide; PR0533; 

KW PR0301; PR0187; PR0337; PR01411; PRO10096; PR0246; PRO6307; PRO6003; 

KW PRO6004; PR04356; PRO2630; PR0265; PR0941; FGFR; bioactive molecule; 

KW fibroblast growth factor receptor; cell death; chromosome mapping; 

KW gene mapping; transgenic animal; knockout animal; gene therapy; 

KW tumour; obesity; diabetes; insulinaemia; vascular permeability ; 

KW cardiac insufficiency disorder; immune response; hearing loss; 

KW auditory hair cell regeneration; bone disorder; cartilage disorder; 

KW sports injury; arthritis. 

XX 

OS Homo sapiens. 
XX 

PN US2003 032 062-A1. 
XX 
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XX 

PA (GETH ) GENENTECH INC 
XX 

PI Ashkenazi AJ, Baker KP, Bot stein DA, Desnoyers l, Eaton DL; 



PI Perrara N, Fong S, Gao W, Gerber H, Gerritsen ME, Goddard A; 

PI Godowski PJ, Gurney AL f Kljavin I J, Mather JP, Napier MA, Pan J; 

PI Paoni NF , Roy MA, Stewart TA, Tumas D, Watanabe CK, Williams PM; 

PI Wood WI, Zhang Z; 

XX 

DR WPI; 2003-341963/32. 

DR N-PSDB; ACA04448 . 
XX 

PT New secreted and transmembrane polypeptide for modulating biological 

PT activity of a cell expressing the polypeptide , identifying agonists or 

PT antagonists of the polypeptide, and as molecular weight markers - 
XX 

PS Claim 12; Fig 10; 254pp ; English. 
XX 

CC The invention describes an isolated, secreted and transmembrane 

CC polypeptide (I) , termed PRO polypeptide. (I) Is useful for detecting 

CC PR0533, PRO301, PR0187, FR0337, PR01411, PRO10096, PR0246, PRO6307, 

CC PRO6003, PROS 004 , PR04356, PRO2630, PR0265, PR0941, fibroblast growth 

CC factor receptor (FGFRJ-4, FGFR-3 , FGFR-2 or FGFR-1 polypeptide, and for 

CC linking a bioactive molecule e.g. toxin, radiolabel or antibody, to a 

CC cell expressing the polypeptides. The bioactive molecule causes cell 

CC death. (II) 'Is useful as hybridisation probes, in chromosome and gene 

CC mapping, in generation of ant is ens e RNA and DNA, in the preparation of 

CC PRO polypeptide, for generating transgenic animals or knockout animals 

CC which in turn are useful in the development and screening of 

CC therapeutically useful reagents, and for the genetic analysis of 

CC individuals with genetic disorders, in gene therapy, and for chromosome 

CC identification. (I) Or Ab is useful for the preparation of medicament for 

CC treating conditions which are responsive to the PRO polypeptide or 



CC anti-PRO antibody e.g. a tumour. (I) is useful for treating obesity, 

CC diabetes or hypo- or hyper-insulinaemia, and cardiac insufficiency 

CC disorders, for inhibiting tumour growth, enhances vascular permeability 

CC and immune response, for inducing regeneration of auditory hair cells and 

CC for treating hearing loss in mammals, and for treating bone and/or 

CC cartilage disorders such as sports injuries and arthritis. This is the 

CC amino acid sequence of a novel human secreted and transmembrane 

CC polypeptide. 
XX 

SQ Sequence 448 AA; 



Query Match 99.8%; Score 2527; DB 24; Length 448; 

Best Local Similarity 99.8%; Pred. No. 9.9e-159; 

Matches 44 7; Conservative 0; Mismatches 1; Indels 0; Gaps 0 
Qy 1 MPGI KRILTVTI LALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

MM II, IMIMII. NIINI 1 1 lllllll, llllllll 1 NIMH 1 1 Ml Mill; 

Db 1 MPGI KRILTVTI LALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

Qy 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 120 

MIMMIIIIIIIMMIIIMIIIMMIMllllllMMIIIMMIMIMMI! 

Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD 120 

Qy 121 ESNQ CVDVDECATDSHQCNPTQ I C I NTEGG YTCS CTDG YWLLEGQCLD I DE CR YG YCQQL 180 

J f 1 1 e I j 1 1 r j 1 1 i i r m j ri r 1 1 1 f ! r f 1 1 1 j 1 1 r j 1 1 f i r i j i f i J 1 1 1 m i ii f 1 1 i 

Db 121 ESNQCVDVDECATDSHQ CNPTQ I C I NTEGG YTCS CTDG YWLLEGQCLD I DE CRYG YCQQL 180 



QY 



181 CAN VPGS YS CTCN PGFTLNEDGRS CQDVNECATENPC VQTCVNTYGS F I CRCD PG YELEE 240 



iiiiiiiimimiimiiiMiiiiimiiiiimiimi mniiiiiii 

Db 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSLICRCDPGYELEE 24 0 



Qy 241 DGVHCSDMDECS FS EFLCQHECVNQPGTYFCS CPPGY I LLDDNRSCQD I NECEHRNHTCN 300 

[[IIIIIIElllilllllllllllllllliilMlllllllillMIMIIlilllllll 

Db 241 DGVHCSDMDECS FS EFLCQHECVNQPGTYFCS CP PGY I LLDDNRS CQD I NECEHRNHTCN 300 



Qy 301 LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 36 0 

IMIIIIMIIIIIIIIIIMIIIMIIIIIIIIlilMIIIllllllllllMllllll 

Db 301 LQQTCYNLQGGFKCIDPI RCEEPYLRI SDMRCMCPAENPGCRDQPFTI LYRDMDWSGRS 360 



Qy 361 VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRP I KGPREI QLDL 420 

l[|MII]|[|ll!l!lil]!llll[|]l!lll[|[[llII[l|[|l!llll[|j||||| 

Db 361 VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRPI KGPREI QLDL 420 

Qy 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 

ttllillltllllllHIIIIinitll 
Db 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 



Search completed: January 9, 2 004, 12:33:33 
Job time : 41.6051 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

perfect score: 
Sequence: 

Scoring table: 



January 9, 2004, 12:30:17 ; Search time 15.9449 Seconds 
(without alignments) 
1188.799 Million cell updates/sec 

US-09-674-379A-13 
2533 

1 MPGIKRILTVTILALCLPSP INFRGSSVIRLRIYVSQYPF 448 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 328717 seqs, 42310858 residues 

Total number of hits satisfying chosen parameters: 328717 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1: /cgn2_6/ptodata/l/iaa/5A_C0MB.pep:* 

2 : /cgn2_6/ptodata/l/iaa/5B_C0MB.pep: * 

3: / cgn2 _6 /p t oda t a / 1 / i aa / 6 A_C0MB . p ep : * 

4: /cgn2_6/ptodata/l/iaa/6B_C0MB.pep: * 

5: /cgn2_6/ptodata/l/iaa/PCTUS_C0MB.pep: * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-08-884-072-1 

; Sequence 1, Application US/08884072 
; Patent No. 5872234 
; GENERAL INFORMATION: 

APPLICANT: Bandman, 01 ga 

APPLICANT: Corley, Neil C. 

APPLICANT: Guegler, Karl J. 

TITLE OF INVENTION: HUMAN EXTRACELLULAR MATRIX PROTEINS 
NUMBER OF SEQUENCES; 6 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 
; STATE: CA 

COUNTRY : USA 

ZIP: 94304 
COMPUTER READABLE FORM: 



MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/884 / 072 
FILING DATE: Herewith 
CLASSIFICATION: 424 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER; 
FILING DATE: 
ATTORNEY/AGENT INFORMATI ON : 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER : 36,749 
REFERENCE/DOCKET NUMBER: PF-0333 US 
TELECOMMUNI CATION INFORMATION : 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
TELEX: 

INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 448 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY : CORNNOT0 1 
CLONE: 45517 
US-08-884-072-1 

Query Match 100.0%; Score 2533; DB 2; Length 448; 

Best Local Similarity 100.0%; Pred. No. 3.2e~193; 

Matches 448; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


1 


MPGI KRI LTVTI LALCLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 

IIIIMItttllftltlilltltllillilMtiililllillillllllllltlllltl 

MPGI KRI LTVTI LALCLPSPGNAOAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 


60 


Db 


1 


60 


Qy 


61 


NQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD 

IIII1IIII1IIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

NQNGGYLCI PRTN PVYRGPY SN PY STPY SGPY PAAAPPLSAPNYPTI SRPLI CRFGYQMD 


120 


Db 


61 


120 


Qy 


121 


ESNQCVDVDE CATDSHQCNPTQ I C INTEGG YTCS CTDG YWLLEGQCLD I DECRYG YCQQL 

lllillMMIIilllllllllllMIIIIUKIIIIIIiililllilllllillllll 

ESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 


180 


Db 


121 


180 


Qy 


181 


CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 


240 


Db 


181 


CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 


240 


Qy 


241 


DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 

IMIMIIIIIIIilllllllllllllllllllllMllllllllllilllllllllfll 

DGVHCSDMDECS FS EFLCQHECVNQPGT YFCS CP PG YI LLDDNRS CQDI NECEHRNHTCN 


300 


Db 


241 


300 


Qy 


301 


LQQTCYNLQGGFKCI DPI RCEEPYLR I SDNRCMCPAENPGCRDQPFTI LYRDMD WSGRS 

ilillillilllilllltilililllltllflilllllllllilliillilillililil 


360 


Db 


301 


LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 


360 



Qy 361 VPAD I FQMQATTRYPGAYY I FQI KSGNEGREFYMRQTGPI SATLVMTRP I KG PRE I QLDL 420 

MllillMMIIIIIMIIIillMIIIIIINlllllMllllllljlMIMIMII 

Db 361 VPADI FQMQATTRYPGAYY! FQI KSGNEGREFYMRQTGPI SATLVMTRP I KGPREI QLDL 420 

Qy 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 

Illlllllllllllllllllllllllll 

Db 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 



RESULT 2 
US-09-212-168-1 

; Sequence 1, Application US/ 09212168 
; Patent No. 6303765 
; GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 

APPLICANT: Corley, Neil C. 

APPLICANT: Guegler, Karl J. 

TITLE OF INVENTION: HUMAN EXTRACELLULAR MATRIX PROTEINS 
NUMBER OF SEQUENCES : 6 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY : USA 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE : FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/212,168 

FILING DATE: 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 08/8.84,072 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0333 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 

TELEX: 

INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 448 amino acids 
TYPE: amino acid 
STRANDEDNESS : Single 
; TOPOLOGY: linear 

IMMEDIATE SOURCE: 

LIBRARY: CORNNOT01 
CLONE: 45517 
US-09-212-168-1 



Query Match 100.0%; Score 2533; DB 4; Length 448; 

Best Local Similarity 100.0%; Pred. No. 3.2e-193; 

Matches 448; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MPGI KRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

IMIMlIIIIIIIIIIIIIMIillljIMIllllllMIIIIIIIIIIIIIlllllll 

Db 1 MPGI KRI LTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 60 

Qy 61 NQNGGYLC1 PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD 120 

llltiiilllltSllillllllllllliilllllllliilllillilillllllltlill 

Db 61 NQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD 120 

Qy 121 ESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 180 

illilllllllllltlllllllllllllllllllliilMIIIIIIMMilllllllll 
Db 121 ESNQCVDVDECATDSHQCMPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 180 

Qy 181 CANVPGS YS CTCNPGFTLNEDGRS CQDVNE CATENPC VQTCVNT YGSF I CRCDPGYELEE 240 

lllfllMIIIIIIIIMfllllllllllfllllllllllllllllllMIIIIIIIIII 
Db 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEE 240 

Qy 241 DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 300 

IIIIIilMMlil-IIMIIIIIllMtllllllMlllllillliilliilllllllil 

Db 241 DGVHCSDMDECSFSEFLCQHECVWQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 300 

Qy 301 LQQTCYNLQGGFKCI DPIRCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 360 

l!lllllllilltlllllllllllllll!lllllilll!ISIilili!illlllllllll 

Db 301 LQQTCYNLOGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 360 

Qy 361 VPADIFQMQATTRYPGAYYI FQIKSGNEGREFYMRQTGPI SATLVMTRPI KGPREIQLDL 420 

Db 361 VPADI FQMQATTRYPGAYYI FQIKSGNEGREFYMRQTGPI SATLVMTRPI KGPREI QLDL 420 

Qy 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 

iiiiiiiiiinfiitmiiiiimi 

Db 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 



RESULT 3 

US-09-312-283C-186 

; Sequence 186, Application US/09312283C 

; Patent No. 6573095 

; GENERAL INFORMATION; 

; APPLICANT: Watson, James D. 

; APPLICANT: Strachan, Lorna 

; APPLICANT: Sleeman, Matthew 

; APPLICANT: Onrust, Rene 

; APPLICANT: Murison, James G. 

; APPLICANT: Kumble, Krishanand D. 

TITLE OF INVENTION: Compositions Isolated from Skin Cells 
; TITLE OF INVENTION: and Methods for Their Use 
; FILE REFERENCE: 11000. 10HC2 
; CURRENT APPLICATION NUMBER: US/09/312 , 283C 
; CURRENT FILING DATE: 1999-05-14 
; NUMBER OF SEQ ID MOS: 425 
; SOFTWARE: Fast SEQ for Windows Version 4.0 
; SEQ ID NO 186 



; LENGTH: 335 

TYPE: PRT 

ORGANISM: Mouse 
US-09-312-283C-186 



Query Match ' 72.1%; Score 1827; DB 4; Length 335; 

Best Local Similarity 95.2%; Pred. No. 2.1e-137; 

Matches 319; Conservative 7; Mismatches 9; Indels 0; Gaps 0; 



Qy 


114 


RPGYQMDESNQCVTjiVDECATDSHQCNPTQICIWTEGGYTCSCTDGYWLLEGQCLDIDECR 

i 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 M i 1 1 1 i I M 1 1 1 1 1 1 1 1 1 i 1 1 II 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 


173 


Db 


1 


1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 ] 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 | 1 [ 1 | 1 | 1 1 1 i I | 1 

RFGYQMDEGNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYVJLLEGQCLDIDECR 


60 


Qy 


174 


YGYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCD 
1 ! 1 1 1 1 1 i 1 I 1 1 ! I 1 1 1 1 1 1 1 II 1 1 1 : I 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


233 


Db 


61 


YGYCQQLCANVPGSYSCTCNPGFTLNDDGRSCQDVNECETENPCVQTCVNTYGSFICRCD 


120 


Qy 


234 


pgyeleedgvhcsdmdecsfseplcqhecvnqpgtyfcscppgyillddnrscqdinece 

IIIIIlMhIIIIIIIIIIMIIIIIIIIIIIhllllllllhlhlllllfllllll 
pgyeleedgihcsdmdecsfseflcqhecvnqpgsypcscppgyvllednrscqdinece 


293 


Db 


121 


180 


Qy 


294 


HRNHTCNLQQTCYNLQGGFKCI DPI RCEEP YLR I SDNRCMCPAENPGCRDQPFTI LYRDM 

iniii inmiiimiiii mill ; minim mimiihioii 


353 


Db 


181 


hrnhtctplqtcynlqggfkci dpi vceepylli gdnrcmcpaentgcrdqpftilfrdm 


240 


Qy 


354 


DWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP 1 SATLVMTRP 1 KGP 


413 


Db 


241 


lltlllll!ill!ltililllillllllltllllSllllllllllllll!lillllllll 

DWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRP I KGP 


300 


Qy 


414 


REI QLDLEMI TVNTV1 NFRGSS VI RLRI YVSQYPF 448 

MUlimUllIlMinillMlilMMtt 

RDI QLDLEMI TVNTVI NFRGSS VI RLR I YVSQYPF 335 




Db 


301 




RESULT 4 
US-09-188 


-930 


-186 





Sequence 186, Application US/ 09 18 8 93 OA 
Patent No. 6150502 



GENERAL INFORMATION: 

APPLICANT: Watson, James D. 
APPLICANT: Strachan, Lorna 
APPLICANT: Sleeman, Matthew 
APPLICANT; Onrust, Rene 
APPLICANT: Murison, James Greg 

TITLE OF INVENTION: Compositions Isolated From Skin Cells 
TITLE OF INVENTION: and Methods For Their Use 
FILE REFERENCE : 11000. lOllcl 

CURRENT APPLICATION NUMBER: US/09/188 , 930A . 
CURRENT FILING DATE: 1998-11-09 
NUMBER OF SEQ ID NOS : 348 
SOFTWARE: Fa st SEQ for Windows Version 3.0 
SEQ ID NO 186 

LENGTH: 337 

TYPE : PRT 

ORGANISM: Rat 

FEATURE: 

NAME/KEY: UNSURE 



LOCATION: 
NAME /KEY ; 
LOCATION: 
NAME/ KEY : 
LOCATION: 
NAME /KEY : 
LOCATION: 
NAME /KEY: 
LOCATION: 
NAME /KEY : 
LOCATION: 
US-09-188-930- 



(14) . . . 


(124) 


UNSURE 




(135) - . 


. (135) 


UNSURE 




(141) . . 


- (141) 


UNSURE 




(166) . . 


. (166) 


UNSURE 




(167) 


. (167) 


UNSURE 




(1S3) . . 


. (183) 


186 





Query Match 70.7%; 
Best Local Similarity 93.7%; 
Mat ches 314; Cons ervat ive 



Score 1791; DB 3; 
Pred. No. 1.5e-134; 
6; Mismatches 15; 



Length 337; 



IncLels 



0 ; Gaps 



Qy 


114 


Db 


3 


Qy 


174 


Db 


63 


Qy 


234 


Db 


123 


Qy 


294 


Db 


183 


Qy 


354 


Db 


243 


Qy 


414 


Db 


3 03 



llllllll IIIIIIIIIIIIMIIIIIIIIIllllllMIIIIIIIIIIIIMIIflfl 

RFGYOMDEGNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDIDECR 62 



iiiHiiiiiimiiiMMiiiihimiiiMM iimimiiimuiM! 

YGYCQQLCANVPGSYSCTCNPGFTLNDDGRSCQDVNECETENPCVQTCVNTYGSFI CRCD 122 



I III I III HUM Hill II Mil II llllllll 



lllllllltl lillii t iillllllll iliiill 



llllllll 



I I I I I I 



Mill 



lllllllll 



Miiiiiiiiiiiiiiiiiiiiiiiiiimiit 



RESULT 5 

US-08-833-963C-2 

Sequence 2, Application US/08833 963C 
Patent No. 5916769 
GENERAL INFORMATION: 

APPLICANT: Olsen, et al . 

TITLE OF INVENTION: Extracellular/Epidermal Growth Factor 
TITLE OF INVENTION: HCABA58X 
NUMBER OF SEQUENCES : 9 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Ave 

CITY: Rockville 

STATE : MD 

COUNTRY : USA 



ZIP; 20850 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 
; COMPUTER *. IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE : Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/833 , 963C 

FILING DATE: ll-APR-1997 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/US96/05033 
FILING DATE: 10-APR-1996 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Brookes, A. Anders 

REGISTRATION NUMBER: 36,373 
; REFERENCE /DOCKET NUMBER: PF25B 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 
TELEFAX: 301-309-8439 
; INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 443 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-963C-2 

Query Match 50.9%; Score 1289; DB 2; Length 443; 

Best Local Similarity 49.9%; Pred. No. 1.3e-94; 

Matches 226; Conservative 74; Mismatches 131; Indels 22; Gaps 5; 

Qy 1 MPGI KRI LTVTI LALCLPSPGNAQ AQCTNGFDLDRQSGQCLDI DECRTI PEACRG 55 

••II ■- '-I I II •••••• -.||:|- I I I I--II I I I I I I >• I 

Db 8 LPGSLLLWALLLLLLGSASPQDSEEPDSYTECTDGYEWDPDSQHCRDVNECLTIPEACKG 67 

Qy 56 DMMCVNQNGGYLCI PRTNP VYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRF 115 

• I hi llllhlh I : II I II II I 

Db 68 EMKCINHYGGYLCLPRSAAVINDLHG EGP-PPPVPPAQHPN--- PCPP 111 

Qy 116 GYQMDESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYG 175 

ii: i= Illlllll I I hi II I I hi Ml = chimin 

Db 112 GYEPDDQDSCVDVDECAQALHDCRPSQDCHNLPGSYQCTCPDGYRKIGPECVDIDECRYR 171 

Qy 176 YCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPG 235 

III I hilh I I 111 I : III Mill II I I hlhhlll I 
Db 172 YCQHRCVNLPGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCHQG 231 

Qy 236 YELEEDGVHCSDMDECSFS EFLCQHE CVNQPGTYFCS CP PG YI LLDDNRS CQD I NE CEHR 295 

III II llhlllhl :)lh Mhll : 1 II II II I lllhlll 
Db 232 YELHRDGFSCSDIDECSYSSYLCQYRCVNEPGRFSCHCPQGYQLL-ATRLCQDIDECESG 290 

Qy 296 NHTCNLQQTCYNLQGGFKCI DP I RCEEPYLRI SDNRCMCPAENPGCRDQPFTILYRDMDV 355 

I 1= III I ll-hl II I 11 = :: hi I 1 = 111 II Ihll :h = | I = 
Db 291 AHQCSEAQTCWFHGGYRCTOTNRCVEPYIQVSEKIRCLCPASNPLCREQPSSIVHRYMTI 350 

Qy 356 VSGRS VPADI FQMQATTRYPGAYYI FQIKSGNEGREFYMRQTGP I SATLVMTRP I KGPRE 415 



I IIIIIMMIh Mill |||::|| :|[:[| :|| 11= lh Mil 
Db 351 TSERSVPADVFQIQATSVYPGAYNAFQIRAGNSOGDFYIRQINNVSAMLVIiARPVTGPRE 410 

Qy 416 IQLDLEMITVNTVINFRGSSVIRLRIYVSQYPF 448 

I I I I I I : I : - - I llhll -I I I 
Db 411 YVLDLEMVTMNSLMS YRAS SVLRLTVFVGAYTF 443 



RESULT 6 
US-08-980-514-1 

; Sequence 1, Application US/08980514 
; Patent No. 6004753 

; GENERAL INFORMATION: 

APPLICANT: Yue, Henry 
APPLICANT: Guegler, Karl J. 
APPLICANT: Shah, Purvi 

TITLE OF INVENTION: HUMAN S1-5-ECMP-LIKE PROT 
TITLE OF INVENTION: EIN 
NUMBER OF SEQUENCES: 3 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Dr. 

CITY: Palo AltO 

STATE: CA 

COUNTRY: USA 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER : IBM Compa t ibl e 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/98 0 , 5 14 

FILING DATE: Filed Herewith 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 
; ATTORNEY /AGENT INFORMATION: 

NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 
; REFERENCE/DOCKET NUMBER: PF-0436 US 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: 650-855-0555 

TELEFAX: 650-845-4166 
; INFORMATION FOR SEQ ID NO: 1: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 443 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: BRSTNOT13 

CLONE: 278644 9 
US-08-980-514-1 

Query Match 50.9%; Score 1289; DB 3; Length 443; 

Best Local Similarity 49.9%; Pred. No. 1.3e-94; 



Matches 226; Conservative 74; Mismatches 131; Indels 22; Gaps 5; 

IPG I KRILTVT I LALCLPS PGNAQ AQCTNGFDLDRQSGQCLD IDECRTI PEACRG 55 

•\\ : : =| I II : I I = I • I I I h = ll lllllhl 



= 1 hi llllhlh I II I II II | 

EMKC I NHYGGYLCLPRSAAVI NDLHG EGP-PPPVPPAQHPN PCPP 111 



11= h : llllllil I I hi I I I I hi III = :hl!!l!ll 
GYEPDDQDSCVDVDECAQALHDCRPSQDCHNLPGSYQCTCPDGYRKIGPECVDIDECRYF 

YCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPG 

III I hllh I I Ml I III II I II II I I I - 1 1 - h I i I I 

YCQHRCVNLPGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCHQG 

YELEEDGVHCSDMDE CS FS EFLCQHE CVNQPGTYFCS CP PG Y I LLDDNRS CQDI NECEHE 

III II llhlllhl ■ llhll : I II II II I nihil! 
YELHRDGFS CSD I DE CS YS S YLCQYRCVNE PGRFS CHCPQG YQLL - ATRLCQDI DECESC 

NHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAEMPGCRDQPFTILYRDMD\ 

I h III I lh = hl II Mh-hllhlll II Ihll =1-1 I : 
AHQCSEAQTCVNFHGGYRCVDTNRCVEPYIQVSENRCLCPASNPLCREQPSSIVHRYMTI 

VSGRS VP AD I FQMQATTRY PGAYY I FQI KSGNEGREFYMRQTG P I SATLVMTRP I KGPRE 

I lllllhlhllh Mill , , i ii lh III! 

TSERSVPADVFQI QATSVYPGAYNAFQI RAGNSQGDFY I RQI NNVSAMLVLARPVTGPRI; 

IQLDLEMITVNTVINFRGSSVIRLRI YVSQYPF 448 
llllhhh::::| llhll = = l I I 



Qy 


1 


Db 


8 


Qy 


56 


Db 


68 


Qy 


116 


Db 


112 


Qy 


176 


Db 


172 


Qy 


236 


Db 


232 


Qy 


296 


Db 


291 


Qy 


356 


Db 


351 


Qy 


416 


Db 


411 



RESULT 7 
US-08-884-072-5 

; Sequence 5, Application US/08884072 
; Patent No. 5872234 
; GENERAL INFORMATION: 
; APPLICANT: Bandman, Olga 
; APPLICANT: Corley, Neil C. 
APPLICANT: Guegler, Karl J. 

TITLE OF INVENTION: HUMAN EXTRACELLULAR MATRIX PROTEINS 
NUMBER OF SEQUENCES: 6 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 
; CITY: Palo AltO 

STATE : CA 
; COUNTRY; USA 

21 P: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM : DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 



CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/884,072 
FILING DATE: Herewith 
CLASSIFICATION: 424 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/ AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER : 36,749 
REFERENCE/DOCKET NUMBER: PF-0333 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
TELEX: 

INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 387 amino acids 
TYPE: amino acid 
STRANDEDNESS : s ingl e 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 
LIBRARY : GenBank 
CLONE: 458228 
US-08-884-072-5 

Query Match 38.0%; Score 963.5; DB 2; Length 387; 

Best Local Similarity 48.3%; Pred. No. 7.3e-69; 

Matches 171; Conservative 55; Mismatches 123; Indels 5; Gaps 

APPLSA PNYPT I SRPLI CRFG YQMDESNQCVDVDECATDSHQ CNPTQ I C I NTEGG YTCS C 

II hi I : I II : 111 hill :| I hill I : I I 
ADPQRI PSNP - - SHRI QCAAGYEQSEHNVCQDI DECTAGTHNCRADQVCINLRGSFACQC 

TDGYVf LLEGQCLDI DECRY -GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATE 

II Ihlllll II I I I llh I h-lll I : , hill 

PPGYQKRGEQCVDIDECTIPPYCHQRCVNTPGSFYCQCSPGFQLAANNYTCVDINECDAS 

NPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCP 

I I I I I llllhh Mil I ' hill I :|lh:|lhll ^ I II 
NQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCP 

PGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DPIRCEEPYLRI SDNRCMC 

II -•|--lllillll I i : |:| IIM | - s I I 1 ; I I I I 

QGYQW-RSRTCQDINECETTNE-CREDEMCWNYHGGFRCYPRNPCQDPYILTPENRCVC 

PAENPGCRDQ P FT I LYRDMD WSGRS VPADI FQMQATTRY PGAY Y I FQ I KSGN EGREF YM 

I I Ih I :hh I : I lllhlllhllll I hllllll llh 

P VSNAMCRELPQS I VYKYMS I RSDRS VPSDI FQI QATTI YANTINTFRI KSGNENGEFYL 

RQTGP I SATLVMTR P I KGPRE I QLDLEM I TVNTVT NFRGS S VI RLR I YVSQYPF 448 

III hll Ih : : Mil :Mih)h:: II llhll 1 I = I 



Qy 


96 


Db 


38 


Qy 


156 


Db 


96 


Qy 


215 


Db 


156 


Qy 


275 


Db 


216 


Qy 


335 


Db 


274 


Qy 


395 


Db 


334 



RESULT 8 

US-08-833-963C-9 



; Sequence 9 f Application US/08833963C 
; Patent No. 5 916769 
; GENERAL INFORMATION: 

APPLICANT: Olsen, et al . 
; TITLE OF INVENTION; Extracellular/Epidermal Growth Factor 
TITLE OF INVENTION: HCABA58X 
NUMBER OF SEQUENCES : 9 
CORRESPONDENCE ADDRESS : 
; ADDRESSEE: Human Genome Sciences, Inc. 

STREET : 9410 Key West Ave 
; CITY: Rockville 

STATE : MD 
COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 
; MEDIUM TYPE : Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patent In Release #1.0, Version #1.3 0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/833 , 963C 
FILING DATE: ll-APR-1997 
; CLASSIFICATION: 435 

PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: WO PCT/US96/ 05033 

FILING DATE: 10-APR-1996 
; ATTORNEY/ AGENT INFORMATION: 
; NAME: Brookes, A. Anders 

REGISTRATION NUMBER: 36,373 
REFERENCE/DOCKET NUMBER: PF258 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 
TELEFAX: 301-309-8439 
; INFORMATION FOR SEQ ID NO: 9: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 387 amino acids 

; TYPE: amino acid 

; STRANDEDNESS : Single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-963C-9 

Query Match 38.0%; Score 963.5; DB 2; Length 387; 

Best Local Similarity 48.3%; Pred. No. 7.3e-69; 

Matches 171; Conservative 55; Mismatches 123; Indels 5; Gaps 4 

Qy 96 APPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQICINTEGGYTCSC 155 

II h I I = I 11= I I I hill =1 I Mil I': I | 
Db 38 ADPQRI PSNP - - SHRIQCAAGYEQSEHNVCQDIDECTAGTHNCRADQVCINLRGSFACQC 95 

Qy 156 TDGYVJLLEGQCLDI DECRY -GYCO^LCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATE 214 

II Ihlllll II I I I llh I hill I : I hill 

Db 96 PPGYQKRGEQCVDIDECTIPPYCHQRCV1QTPGSFYCQCSPGFQ1AANNYTCVDINECDAS 155 



Qy 215 NPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCP 274 

I I I I I llllhh III! I - I hill I : IIh = llhII • I II 

Db 156 NQCAQQC YN I LGS F I CQCNQGYELSSDRLNCED I DECRTS S YLCQYQCVNEPGKFS CMCP 215 



Qy 

Db 



275 PG YI LLDDNRS CQDI NE CEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEP YLR I SDNRCMC 334 

II :|:|IHIIII I I * hi llh I h = I h ■ = I I h' I 

216 QGYQW-RSRTCQDINECETTNE-CREDEMCWNYHGGFRCYPRNPCQDPYILTPENRCVC 273 



335 PAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGAYYIFQIKSGNEGREFYM 394 



I I I I : I : I : I : t : i I I 1 I : 1 I I I • 1 1 I I I I • I I I I I I I I i : 

274 PVSNAMCRELPQS I VYKYMS I RSDRS VPSDI FQI QATTI YANTI NTFRI KSGNENGEF YL 





Db 



Qy 



395 RQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQYPF 448 



Db 




RESULT 9 
US-OB-980-514-3 

; Sequence 3, Application US/08980514 
; Patent No. 6004753 
; GENERAL INFORMATION : 

APPLICANT: Yue, Henry 
APPLICANT: Guegler, Karl J. 
; APPLICANT: Shah, Purvi ■ 

TITLE OF INVENTION: HUMAN S1-5-ECMP-LIKE PROT 
TITLE OF INVENTION: EIN 
NUMBER OF SEQUENCES: 3 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Dr. 
CITY: Palo Alto 
STATE : CA 
COUNTRY : USA 
; ZIP: 94304 

COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ for Windows Version 2.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/98 0 , 514 
FILING DATE: Filed Herewith 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0436 US 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 650-855-0555 
; TELEFAX ; 650-845-4166 

; INFORMATION FOR SEQ ID NO: 3: 

SEQUENCE CHARACTERISTICS: 
; LENGTH : 387 amino acids 

; TYPE: amino acid 

STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 



; LIBRARY: GenBank 

CLONE: 458228 
US-08-980-514-3 

Query Match 38-0%; Score 963.5; DB 3; Length 387; 

Best Local Similarity 48.3%; Pred. No. 7.3e~69; 

Matches 171; Conservative 55; Mismatches 123; Indels 5; Gaps 4; 

Qy 96 APPLSAPNYPTISRPLICRPGYQMDESNQCVDVDECATDSHQCNPTQICIMTEGGYTCSC 155 

i I hi t M Ih II 1 hill I I Mil I : I I 

Db 38 ADPQRIPSNP--SHRIQCAAGYEQSEHNVCQDIDECTAGTHNCRADQVCINLRGSPACQC 95 

Qy 156 TDGYWLLEGQCLDIDECRY-GYCQQLCANVPGSYSCTCNPGFTLHEDGRSCQDVNECATE 214 

il I hill II II I I ! 1 1 1 - I hill I •■ ■•! hill 

Db 96 P PGYQKRGEQCVD I DECT I P P YCHQRCVNTPGS F YCQCS PGFQLAANNYTCVD INECDAS 155 

Qy 215 NPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEPLCQHECVNQPGTYFCSCP 274 

Mill III! hh llll I - I hill i 1 I I - I I h I I : II I 

Db 156 NQCAQQC YNI LGSF I CQ CNQGYELS SDRLNCED I DECRTSS YLCQ YQ CVNEPGKFSCMCP 215 

Qy 275 PGYILLDDMRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMC 334 

1M = M M I I 1 i I i 1 I 1 : hi ilhl |::Ih = ! I h I 

Db 216 QG YQ W- RSRTCQDI NE CETTNE - CREDEMCWNYHGGFRCY PRNPCQDP YI LTPENRCVC 273 

Qy 335 PAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYIFQI KSGNEGREFYM 394 

i I Ih I ":hh I M I Ilhl Ilhl I II I hi III II llh 

Db 274 PVSNAMCRELPQSIVYKYMSIRSDRSVPSDI FQ1QATTI YANTINTFRI KSGNENGEFYL 333 

Qy 395 RQTGP I SATLVMTRP I KGPRE I QLDLEMITVNTVI WFRGSSVI RLRI YVSQYPF 448 

III hli Ih : = llll MIMM: II llhll MM 
Db 334 RQTSPVSAMLVLVKSLSGPREHI VDLEMLTVSS I GTFRTSS VLRLTI I VGPFSF 387 



RESULT 10 
US-09-212-168-5 

; Sequence 5, Application US/0 9212168 

; Patent Wo. 6303765 

; GENERAL INFORMATION: 

; APPLICANT: Bandman, Olga 

APPLICANT: Corley, Neil C. 
; APPLICANT: Guegler, Karl J. 

; TITLE OF INVENTION: HUMAN EXTRACELLULAR MATRIX PROTEINS 
NUMBER OF SEQUENCES: 6 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
; STREET: 3174 Porter Drive 

CITY: Palo Alto 
STATE: CA 
COUNTRY: USA 
ZIP : 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
; OPERATING SYSTEM : DOS 

; SOFTWARE: FastSEQ for Windows Version 2.0 

; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/212,168 



FILING DATE : 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/884,072 

FILING DATE: 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER : 36,749 

REFERENCE/DOCKET NUMBER: PF-0333 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 

TELEX : 

; INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 387 amino acids 
; TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
; IMMEDIATE SOURCE: 
; LIBRARY: GenBank 

; CLONE: 458228 

US-09-212-168-5 

Query Match 38.0%; Score 963.5; DB 4; Length 387; 

Best Local Similarity 48.3%; PrecL No, 7.3e-69; 

Matches 171; Conservative 55; Mismatches 123; Indels 5; Gaps 4; 

Qy 96 APPLSAPNYPTISRPLICRFGYQMDESNQCVDVDECATDSHQCNPTQICINTEGGYTCSC 155 

II I- I i •■ I ||: I I I hill :| I hill I : I I 

Db 38 ADPQRI PSNP - - SHRI QCAAG YEQSEHNVCQD IDECTAGTHNCRADQVCI NLRGSFACQC 95 

Qy 156 TDG YWLLEGQCLD I DECRY - G YCQQLCAN VPGS YS CTCNPGFTLNEDGRS CQDVNECATE 214 

II I h- I II II IN M III-- i hill! : :| hill 

Db 96 PPGYQKRGEQCVDIDECTIPPYCHQRCVNTPGSFYCQCSPGFQLAANNYTCVDINECDAS 155 

Qy 215 N PCVQTCVNTYGS F I CRCDPG YELEEDGVHCS DMD ECSFS EFLCQHECVNQPGTYFCSCP 274 

I I I I I IMIhh III! I -I hill I :|||::|||:|| : | || 
Db 156 NQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCP 215 

Qy 275 PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMC 334 

II 'hllllllll II = hi llhl h=lh = llhl 

Db 216 QGYQW-RSRTCQDINECETTNE-CREDEMCWNYHGGFRCYPRNPCQDPYILTPENRCVC 273 

Qy 335 PAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGAYYIFOJKSGNEGREFYM 394 

I I Ih I :|:|= I = I Nihil II' I I II I hill III 111 = 

Db 274 PVSNAMCRELPQSI VYKYMSI RSDRSVPSDI FQIQATTI YANTI NTFRI KSGNENGEFYL 333 

Qy 395 RQTGPI SATLVMTRPI KGPREI QLDLEMI TVNTVINFRGSSVIRLRI YVSQYPF 448 

III hll lh : = llll :|!Ihlh:: II llhll II = ! 

Db 334 RQTSPVSAMLVLVKSLSGPREHIVDLEMLTVSSIGTFRTSSVLRLTIIVGPFSF 387 



RESULT 11 
US-09-188-930-336 

; Sequence 336, Application US/09188930A 
; Patent No. 6150502 



; GENERAL INFORMATION: 

; APPLICANT; Watson, James D. 

; APPLICANT: Strachan, Lorna 

; APPLICANT: Sleeman, Matthew 

; APPLICANT: Onrust, Rene 

; APPLICANT: Murison, James Greg 

; TITLE OF INVENTION : Compositions Isolated From Skin Cells 

; TITLE OF INVENTION: and Methods For Their Use 

; FILE REFERENCE: 11000. lOllcl 

; CURRENT APPLICATION NUMBER: US/09/188 , 930A 

; CURRENT FILING DATE: 1998-11-09 

; NUMBER OF SEQ ID NOS : 348 

; SOFTWARE: FastSEQ for Windows Version 3.0 

; SEQ ID NO 336 

LENGTH: 274 

TYPE: PRT 
; ORGANISM: Human 
US-09-188-930-336 

Query Match 36.0%; Score 912.5; DB 3; Length 274; 

Best Local Similarity 56.0%; Pred. No. 5.5e-65; 

Matches 154; Conservative 47; Mismatches 73; Indels 1; Gaps 1; 

Qy 174 YGYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSF I CRCD 233 

I III I hllh I I III I : III Mill II I I |:!l:|:|ll 

Db 1 YRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCH 60 

Qy 234 PGYELEEDG VHCSDMDECS FSE FLCQHECVNQPGT YFCS CPPGYI LLDDNRS CQDI NECE 293 

III! II llhlilhl :|lh 111=11 = Ml II II I illhlil 
Db 61 QGYELHRDGFSCSDIDECSYSSYLCQYRCVNEPGRFSCHCPQGYQLL-ATRLCQDIDECE 119 

Qy 2 94 HRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDM 353 

I 1 = Mi I I I = = I = I II llh-Mlblll II 11 = 11 =1-1 1 
Db 120 SGAHQCS EAQTCVNFHGGYRCVDTNRCVE P Y I QVS ENRCLCPASN PLCREQP S S I VHRYM 179 

Qy 354 DWSGRSVPADIFQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGP 413 

I 111111 = 11 = 111 = Mill ]ll = = ll =11 = 11 = 11 11 = 11= II 
Db 180 T I TSERS VPADVFQI OATS VYPGAYNAFQI RAGNSQGDFY I RQI NNVSAMLVLARPVTGP 239 

Qy 414 REI QLDLEMI TVNTVI NFRGSS VI RLRI YVSQYPF 448 

II IIIII=I=I==:=:| 111=11 ==i II 

Db 240 REYVLDLEMVTMNSLMSYRASSVLRLTVFVGAYTF 274 



RESULT 12 

US-09-312-283C-336 

Sequence 336, Application US/09312283C 
Patent No. 6573095 
GENERAL INFORMATION: 

APPLICANT: Watson, James D. 
APPLICANT: Strachan, Loma 
APPLICANT: Sleeman, Matthew 
APPLICANT: Onrust, Rene 
APPLICANT: Murison, James G- 
APPLICANT: Kumble, Krishanand D. 

TITLE OF INVENTION: Compositions Isolated from Skin Cells 
TITLE OF INVENTION; and Methods for Their Use 



; PILE REFERENCE: 11000. 1011C2 

; CURRENT APPLICATION NUMBER: US/09/312, 283C 

; CURRENT FILING DATE: 1999-05-14 

; NUMBER OF SEQ ID NOS : 425 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 336 

LENGTH: 274 

TYPE: PRT 

ORGANISM: Mouse 
US-09-312-283C-336 

Query Match 36.0%; Score 912.5; DB 4; Length 274; 

Best Local Similarity 56.0%; Pred. Mo. 5.5e-65; 

Matches 154; Conservative 4 7; Mismatches 73; Indels 1; Gaps 1; 

Qy 174 YGyCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCD 233 

I III I MM: I I III I : III Mill II I I ..,1,1 

Db 1 YRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCH 60 

Qy 234 PGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDIHECE 293 

llll II llhlllhl *llh 1 I I : I I • I II II II I lllhlll 
Db 61 QGYELHRDGFSCSDI DECS YSS YLCQYRCWEPGRFSCHCPQGYQLL -ATRLCQDI DECE 119 

Qy 294 HRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDM 353 

I |: III I ||::hl II I I I : ^ I H I I = I I I II IMI =| = = l I 
Db 120 SGAHQCSEAQTCVNFHGGYRCVDTNRCVEPYIQVSENRCLCPASNPLCREQPSSIVHRYM 179 

Qy 354 DWSGRSVPADI FQMQATTRYPGAYYIFQI KSGNEGREFYMRQTGPI SATLVMTRPIKGP 413 

•• i iinihihiin inn NI-.--II nnn ni in in n 

Db 18 0 TI TSERS VPADVFQI QATS VY PGAYNAFQI RAGNS QGDF Y I RQ I NNVSAMLVLARPVTGP 239 

Qy 414 REI QLDLEMI TVNTVINFRGSS VI RLRI Y VSQ YPF 448 

II I I I I h I : I I llhll ==l I I 

Db 240 REYVLDLEMVTMNSLMSYRASSVLRLTVFVGAYTF 274 



RESULT 13 

5177197-30 

; Patent No. 5177197 

APPLICANT: KANZAKI , TETSUTO; OLOFSSON, ANDERS ; MOREN , ANITA; 
;WERNSTEDT, CHRISTER ; HELLMAN , ULF/MI YAZONO, KOHEI ; CLAES SON -WELSH, 
; LENA ; HELDIN , CARL-HENRIK 

TITLE OF INVENTION: ISOLATED NUCLEOTIDE SEQUENCE EXPRESSING 
; HUMAN TRANSFORMING GROWTH FACTOR -BETA1 -BINDING PROTEIN 
; NUMBER OF SEQUENCES: 53 
; CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/07/487,343 

FILING DATE: 27-FEB-1990 
;SEQ ID NO:30; 

LENGTH: 1394 
5177197-30 

Query Match 21.5%; Score 544; DB 6; Length 1394; 

Best Local Similarity 29.4%; Pred. No. 5.5e-35; 

Matches 128; Conservative 52; Mismatches 134; Indels 122; Gaps 17; 



Qy 16 CLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPV 75 



hi I h I •■hill II : I II l-il! 

Db 562 CINLPVRYTCICYEGYRFSEQQRKCVDIDECTQVQHLC-SQGRCENTEGSFLCI 614 

Qy 76 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQ 118 

h : : II : I Ih 

Db 615 CPAGFMASEEGTNCIDVDECLRPDVCGEGHCVNTVGAFRCEYCDSGYR 662 

Qy 119 MDESNQCVDVDECATDSHQCNPTQICINTEGGYTC-SCTDGYWLLEGQCLDIDEC-RYGY 176 

I = :| hill Il|> hh I I 1 Ihh llllhlll 
Db 663 MTQRGRCED1DECLNPS-TC-PDEQCVWSPGSYQCVPCTEGFRGWNGQCLDVDECLEPNV 720 

Qy 177 CQQ-LCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQ 219 

! hh III hh hi I = hl-ll I II 

Db 721 CANGDCSNLEGSYMCSCHKGYTRTPDHKHCRDIDECQQGNLCVNGQCKNTEGSFRCTCGQ 78 0 

Qy 220 TCVNTYGSFI CRCDPGYELEEDGVHCSDMDECSF 253 

I II III I II II I II h=|l 

Db 781 GYQLSAAKDQCEDIDECQHRHLCAHGQCRNTEGSFQCVCDQGYRASGLGDHCEDINECLE 84 0 

Qy 254 S EFLCQH - ECVNQPGTYFCS CPPGY I LLDDNRS CQDI NECEHRNHTCNLQQTCYNLQGGF 312 

'■ 01 -'hi hhhll h illl-lllllllll I I I hi I 

Db 841 DKS VCQRGDCINTAGS YDCTCPDGF - QLDDNKTCQDINECEHPG - LCGPQGECLNTEGSF 898 

Qy 313 KCI DPIRCEEPYLRIS DN RCMC PAENPGCR 342 

h I Ih - II Ihl 1 : II 

Db 399 HCVCQQGFSISADGRTCEDIDECVNNTVCDSHGFCDNTAGSFRCLCYQGFQAPQDGQGCV 958 

Qy 343 DQPFTI LYRDMDWSG 358 

I : =-|l 

Db 959 D VNECELLSG 968 



RESULT 14 
US-09-482-273-159 

; Sequence 159, Application US/09482273 

; Patent No. 6534631 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 71 Human Secreted Proteins 
; FILE REFERENCE: PZ030P1 

; CURRENT APPLICATION NUMBER: US/09/482 , 273 

; CURRENT FILING DATE: 2000-01-13 

; EARLIER APPLICATION NUMBER: PCT/US99/15849 

; EARLIER FILING DATE: 1999-07-14 

; EARLIER APPLICATION NUMBER: 60/092,921 

; EARLIER FILING DATE: 1998-07-15 

; EARLIER APPLICATION NUMBER: 60/092,922 

EARLIER FILING DATE: 1998-07-15 
; EARLIER APPLICATION NUMBER: 60/092,956 
; EARLIER FILING DATE: 1998-07-15 
; NUMBER OF SEQ ID NOS : 267 
; SOFTWARE: Patent In Ver. 2.0 
; SEQ ID NO 159 

LENGTH: 575 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-482-273-159 



Query Match 20.6%; Score 521.5; DB 4; Length 575; 

Best Local Similarity 29.4%; Pred. No. l.2e-33; 

Matches 144; Conservative 53; Mismatches 164; Indels 123; Gaps 21; 

Qy 13 LALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCVNQNGGYLCIPRT 72 

II I = I : I t I = 1 ( I |:|:||| II III I I hi 

Db 157 LATALQAEEN - EVGCPEGFELDSQGAFCVDVDECAWDAHLCREGQRCVNLLGS YRCLPDC 215 

Qy 73 NPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECA 132 

I I:*: : I Mill 

Db 216 GP GFRVADGAGCEDVDECL 234 

Qy 133 TDSHQCNPTQICINTEGGYTCSCTDGYWLLEG Q CLDI DECRY- -GYCQQLCANVPGS 18 7 

h M II II: III II -I llh:|| I I h II 

Db 235 EGLDDCHYNQLCENT PGGHRCSC PRGY - RMQGP SLPCLDVNECLQLPKACAYQCHNLQGS 293 

Qy 188 YSCTCNPGFTLNEDGRSC QDVNE CATENPCV QTCVNTYGSFI 22 9 

I I Ml II lh:| hi : I = :| 

Db 294 YRCLCPPGQTLLRDGKACTSLERNGQNVTTVSHRGPLLPWLRPWAS I PGTSYHAWVSLRP 353 

Qy 230 CRCDPG YELEEDGVHCSDMDECS F S EFLCQHECVNQPGTYF CS CP PGY I L 279 

I Ih : -II hhlll MM II hi I II II I 

Db 354 GPMALSSVGRAWCPPGF-IRQMGV-CTDLDECRVRM-LCQHACRNTEGSYQCLCPAGYRL 410 

Qy 280 LDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPA 336 

I --11111111 I I hi • I ••••h-l III 
Db 411 LPSGKNCQDINECEEESIECGPGQMCFNTRGSYQCVD TPCPATYR 455 

Qy 337 --ENPG CRDQ PFTILYRDMDWSGRS VPADI FQMQATTR- -YPGAYYI FQ 382 

■•\\ I I h II = : I h - I I 

Db 456 QGP S PGTCFRRCSQDCGTGGP STLQYRLLPLPLGVRAHHDVARLTAFSBVGVPANRTELS 515 

Qy 383 IKSGNEGREFYMRQTGPISATL- -VMTRPIKGPREI -QLDLEMITVNTVTNFRGSSVIRL 439 

= = 10 h I I I II | : : | :|| | || | 

Db 516 MLEPDPRS PFALR PLRAGLGAVYTR RALTRAGLYRLTVRAAAP - RHQS VFVL 566 

Qy 440 RIYVSQYPF 448 

I II Ih 
Db 567 LIAVSPYPY 575 



RESULT 15 
US-09-482-273-245 

; Sequence 245, Application US/09482273 

; Patent No. 6534631 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al. 

; TITLE OF INVENTION: 71 Human Secreted Proteins 

; FILE REFERENCE : PZ030P1 

; CURRENT APPLICATION NUMBER: US/09/482,273 

; CURRENT FILING DATE: 2000-01-13 

; EARLIER APPLICATION NUMBER: PCT/US99/15849 

; EARLIER FILING DATE: 1999-07-14 

; EARLIER APPLICATION NUMBER: 60/092,921 

; EARLIER FILING DATE: 1998-07-15 

; EARLIER APPLICATION NUMBER: 60/092,922 



EARLIER FILING DATE: 1998-07-15 
EARLIER APPLICATION NUMBER: 60/092,956 
EARLIER FILING DATE: 1998-07-15 
NUMBER OF SEQ ID NOS: 267 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 245 
LENGTH: 638 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-482-273-245 

Query Match 20.6%; Score 521.5; DB 4; Length 638; 

Best Local Similarity 29.4%; Pred. No. l.4e-33; 

Matches 144; Conservative 58; Mismatches 164; Indels 123; Gaps 21; 

Qy 13 LALCLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRT 72 

It I : I : I !MI I 1=1=111 It ill I ! hi 

Db 220 LATALQAEEN'EVGCPEGFELDSQGAFCVX5VDECAWDAHLCREGQRCVNLLGSYRCLPDC 278 

Qy 73 NPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMDESNQCVDVDECA 132 

I l==: = I Hill 

Db 279 GP GFRVADGAGCEDVDECL 297 

Qy 133 TDSHQCNPTQICINTEGGYTCSCTDGYWLLEG— -QCLDIDECRY- -GYCQQLCANVPGS 187 

h hi II Ih III II ::| IIh = II I | h || 

Db 298 EGLDDCHYNQLCENTPGGHRCSCPRGY-RMQGPSLPCLDVNECLQLPKACAYQCHNLQGS 356 

Qy 188 YSCTCNPGFTLNEDGRSC QDVNECATENPCV QTCVNTYGSFI 229 

I I Ml II lh:| hi : M = M = : 

Db 357 YRCLCPPGQTLLRDGKACTSLERNGQNVTTVSHRGPLLPWLRPWASIPGTSYHAWSLRP 416 

Qy 230 CRCDPGYELEEDGVHCSDMDEC5FSEFLCQHECVNQPGTYFCSCPPGYIL 279 

I 11= ■• = = ll hhlll (lit (I hi I li II I 

Db 417 GPMALSSVGRAWCPPGF-IRQNGV-CTDLDECRWN-LCQHACRNTEGSYQCLCPAGYRL 473 

Qy 280 LDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI DPIRCEEPYLRI SDNRCMCPA 336 

I Mlllllll = 1 I hi =] =0 = 1 111 
Db 474 LPSGKNCQDINECEEESIECGPGQMCFNTRGSYQCVD TPCPATYR 518 

Qy 337 --ENPG CRDQ P FT I LYRDMD WSGRS VPAD I FQMQATTR - - YPGAY Y I FQ 382 

=11 i I h II : = I 1= == I = I 

Db 519 QGPS PGTCFRRCSQDCGTGGP STLQYRLLPLPLGVRAHHDVARLTAFS EVGVPANRTELS 578 

Qy 383 I KSGNEGREFYMRQTGPI SATL- -VMTRPI KGPREI -QLDLEMITVNTVTNFRGSS VTRL 439 

I =1 h I I Ml M = I =11 I II I 

Db 579 MLEPDPRSPFALR PLRAGLGA VYTR RALTRAGLYRLTVRAAAP - RHQS VFVL 629 

Qy 440 RIYVSQYPF 448 

I II Ih 
Db 630 LIAVSPYPY 638 



Search completed: January 9, 2004, 12:35:35 
Job time : 17.9449 sees 



GenCore version 5.1.6 
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OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table-. 



January 9, 2004, 12:34:12 ; Search time 29.8324 Seconds 
(without alignments) 
3026.121 Million cell updates/sec 

US-09-674-379A-13 
2533 

1 MPGIKRILTVTILALCLPSP INFRGSSVIRLRIYVSQYPF 448 



BLOSUM62 
Gapop 10 . 



Gapext 0 . 5 



Searched; 747907 seqs, 201509753 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA: * 

1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBC0MB .pep : * 

2 : /cgn2_6/ptodata/2 /pubpaa/PCT_NEW_PUB . pep : * 

3 : / cgn2__6/ptodata/2/pubpaa/US06_NEW_PUB . pep : * 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB .pep : * 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB .pep: * 

6 : /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBC0MB . pep : * 

7 : /cgn2_6/ptodata/2/pubpaa/US08_NEWJPUB .pep: * 

8 : /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB .pep: * 

9 : /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB ,pep ; * 
10 : /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep: * 
11 : /cgn2_6/ptodata/2/pubpaa/US09C__PUBCOMB .pep : * 
12 : /cgn2_6/ptodata/2/pubpaa/US09 JJEW_PUB .pep : * 
13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB .pep: * 
14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB .pep: * 
15 : /cgn2_6/ptodata/2/pubpaa/USlQC_PUBCOMB .pep : * 
16: /cgn2_6/ptodata/2 /pubpaa/USl 0_NEW_PUB . pep : * 
17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB .pep : * 
18 : /cgn2_6/ptodata/2/pubpaa/US60JPUBCOMB .pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-083-002-2 

; Sequence 2, Application US/09083002 

; Patent No. US20010016650A1 

; GENERAL INFORMATION: 

; APPLICANT: Jacobs, Kenneth 



APPLICANT: McCoy, John M. 
APPLICANT: Racie, Lisa A, 
APPLICANT: LaVallie, Edward R. 
APPLICANT: Merberg, David 
APPLICANT: Treacy, Maurice 
APPLICANT: Evans, Cheryl 
APPLICANT: Agostino, Michael 
APPLICANT: Lu, Zhijian 
APPLICANT: Honjo, Tasuku 
APPLICANT: Tashiro, Kei 
APPLICANT: Nakamura, Tomoyuki 
TITLE OF INVENTION: SECRETED PROTEINS 
NUMBER OF SEQUENCES: 2 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Genetics Institute, Inc. 
STREET: 87 CambridgePark Drive 
CITY: Cambridge 
STATE: MA 
COUNTRY: U.S.A. 
ZIP ; 02140 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC -DOS /MS -DOS 
SOFTWARE : Patent in Release #1.0, Version #1.3 0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/08 3 , 002 
FILING DATE: 
CLASSIFICATION: 
ATTORNEY/ AGENT I NFORMATI ON : 
NAME: Sprunger, Suzanne A. 
REGISTRATION NUMBER: P-41,323 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617) 498-8284 
TELEFAX: (617) 876-5851 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 448 amino acids 
TYPE: amino acid 
STRANDEDNESS : not relevant 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-083-002-2 

Query Match 100.0%; Score 2533; DB 9; Length 448; 

Best Local Similarity 100.0%; Pred. No. 1.9e-192; 

Matches 44 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
Qy 1 MPGI KRI LTVT I LALCLPS PGNAQAQCTNGFDLDRQSGQCLD I DECRT I P EACRGDMMCV 60 

ll!li!I!lll!iIill!II!!!!ll(l!!!lli!!I[II!!!iliiI!!Iill!(!M! 

Db 1 MPGI KRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

Qy 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 120 

IlilllllllllllimillMllllllilllllllllllllllllllllllllllll! 

Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 120 



Qy 121 ESNQCVDVDECATDSHQCNPTQI CI NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 180 



iiiiiiiiiiiiiiiimiiiimiimimiiimiiiiiiiiiimiiiiii 



Db 


121 


ESNQCVDVDECATDSHQCMPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 


180 


Qy 


181 


CANVPGS YSCTCNPGFTLNEDGRS CQDVNECATENPCVQTCVNTYGSF I CRCDPGYELEE 

IIIIII![Ii[!l!M[!I!lll(![!l!!ll!(!!!l!!II[!I[|III!!l!IIIII! 

CANVPGSYSCTCNPGPTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 


240 


Db 


181 


240 


Qy 


241 


DGVHCSDMDECSFSEPLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 

1 1 1 1 1 1 1 1 1 II 1 I 1 1 1 M 1 1 1 1 1 I I M 1 1 1 1 1 1 1 1 1 I 11 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 
1 1 1 1 J 1 [ 1 1 1 1 1 1 1 1 M 1 1 H 1 1 1 1 1 1 1 1 I 1 1 M 1 M 1 1 1 1 1 1 1 | 1 | | | 1 M 1 J 1 1 1 1 M 

DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 


300 


Db 


241 


300 


Qy 


301 


LQQTCYNLQGGFKCI DPI RCEEPYLR1 SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 

t I t 1 1 i 1 I 1 I 1 i 1 1 S 1 1 i 1 1 I 1 1 1 1 1 t 1 1 1 i II 1 S 1 1 1 1 1 1 1 1 1 II 1 1 1 S 1 II 1 1 1 1 II 1 
1 M 1 1 1 I 1 1 1 M 1 1 1 ! 1 1 1 1 1 I 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I 1 1 1 I 1 1 I 1 1 1 1 M I 1 1 1 1 1 1 I 

LQQTCYNLQGGFKCI DP I RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 


360 


Db 


301 


360 


Qy 


361 


VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRP I KGPREIQLDL 

lllllllliillllllMlillllMIIIIIIIIillllllllllllllllSilllllll 


420 


Db 


361 


VPAD I FQMQATTRYPGAYYI FQ I KSGNEGREFYMRQTGP I SATLVMTR P I KGPRE I QLDL 


420 


Qy 


421 


EMITVNTVINFRGSSVIRLRIYVSQYPF 448 

IIIIIMIIIIIIIIIIIIIIIIIIII! 




Db 


421 


EMITVNTVINFRGSSVIRLRIYVSQYPF 448 




RESULT 2 
US-09-275 


-805- 


-2 





Sequence 2, Application US/09275805 
Patent No. US20010051358A1 
GENERAL INFORMATION: 

APPLICANT: OLSEN, HENRI K S. 
APPLICANT: LI , HAODONG 

TITLE OF INVENTION: EXTRACELLULAR EPIDERMAL GROWTH FACTOR 
TITLE OF INVENTION: LIKE PROTEIN 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: HUMAN GENOME SCIENCES, INC. 

STREET: 9410 KEY WEST AVENUE 

CITY: ROCKVILLE 

STATE: MD 

COUNTRY : US 

ZIP : 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS /MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/275,805 

FILING DATE: 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/839,525 

FILING DATE: 
ATTORNEY /AGENT INFORMATION: 

NAME: BROOKES, ANDERS A. 

REGISTRATION NUMBER: 36,373 

REFERENCE / DOCKET NUMBER: PF224 



TELECOMMUNI CATION INFORMATION : 
TELEPHONE: (301) 309-8509 
TELEFAX: (301) 309-8512 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 448 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-275-805-2 

Query Match 100.0%; Score 2533; DB 9; Length 448; 

Best Local Similarity 100.0%; Pred. No. 1.9e-192; 

Matches 448; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MPG I KRI LTVTI LALCLPS PGNAQAQCTNG FDLDRQSGQCLDI DECRT I PEACRGDMMCV 60 

Db 1 MPGIKRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

Qy 61 NQNGGYLCIPRTNPVYRGFYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD 120 

Dto 61 NQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD 120 

Qy 121 ESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 180 

Itiltlltlillllliiillilliiiilliililltiltitltliitillltllllllli 
Db 121 ESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 180 

Qy 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 240 

ililllllllllSlllillllllllilllllllillllllllllllliiSllilllllll 
Db 181 CANVPGS YS CTCNPGFTLNEDGRS CQDVNECATEN PCVQTCVNT YGS F I CRCD PG YELEE 240 

Qy 241 DGVHCSDMDECSFSEFLCQHECVNQ PGTYF CS CP PGYI LLDDNRS CQD I NECEHRNHTCN 300 

flllfj|lllllll!ll!!l!llllll!lll!I!lf(II!lll!llll!I!I!ll!l(ll 

Db 241 DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 300 

Qy 301 LQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMCPAENPGCRDQPFTILYRDMDW5GRS 360 

ilMllllMllliliilMMIMillMillflllllMMMItlMIINIIIIll 

Db 301 LQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMD WSGRS 360 

Qy 361 VPADIFQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPRBIQLDL 420 

IflllSlllilllllilllitlllllllllllllllllllllllllllllllllltStll 
Db 361 VPAD I FQMQATTRY PGAYY I FQI KSGNEGREF YMRQTGP I S ATLVMTRP I KGPRE I QLDL 420 

Qy 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 

IIIIIMIllilliilllllllllllll 
Db 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 



RESULT 3 
US-09-836-561-1 

Sequence 1, Application US/09836561 
Patent No. US2 0020038 006A1 
GENERAL INFORMATION: 

APPLICANT: Bandittan, 01 ga 
Corley, Neil C. 
Guegler, Karl J. 
TITLE OF INVENTION: HUMAN EXTRACELLULAR MATRIX PROTEINS 



; NUMBER OF SEQUENCES: 6 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Incyte Pharmaceuticals, inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE: CA 

COUNTRY: USA 

ZIP : 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 
; COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: Fast SEQ for Windows Version 2.0 
; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/ 09/836 , 561 

; FILING DATE: 16 -Apr- 20 01 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/212,168 

FILING DATE: <Unknown> 
ATTORNEY/ AGENT INFORMATION: 
; NAME : Billings, Lucy J. 

REGISTRATION NUMBER; 36,749 

REFERENCE/DOCKET NUMBER: PF-0333 US 
TELECOMMUNICATION INFORMATION : 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
; TELEX: <Unknown> 

; INFORMATION FOR SEQ ID NO: 1: 
; SEQUENCE CHARACTERISTICS *. 

; LENGTH: 448 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY : linear 
IMMEDIATE SOURCE: 

LIBRARY: CORNNOT01 

CLONE: 45517 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-09-836-561-1 

Query Match 100.0%; Score 2533; DB 9; Length 448; 

Best Local Similarity 100.0%; Pred. No. 1.9e-192; 

Matches 448; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MPGI KRI LTVTI LALCLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 60 

Db 1 MPGI KRI LTVTI LALCLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 60 

Qy 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 12 0 

IlIIIIIIIIIIIIIIIIIIIIIIIIIflMlfllllllllfllllllililllllllfl 
Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 120 

Qy 121 ESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 180 

IllllllllllllllilllllitlillllliilllilillltilSilitliillllllll 
Db 121 ESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 180 

Qy 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSF I CRCDPGYELEE 240 



Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



lililllllilllSSIIIIIIIIIillilllllillMllllllllllllilllllllll 

CAKVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSF I CRCDPGYELEE 
DGVHCSDMDECS FS EFLCQHECVNQPGTYFCS CP PG Y I LLDDNRS COD INECEHRNHTCN 

lj!!If!!l!III!IIMII[!ll!IM(llII!Iil!ll!III!l!MI!!!IIIli{l 

DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 
LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 

lIMIIllillMIIMlllllMMMlllllIIlMMMMjlllllllllDllIi 

LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 
VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRP I KG PRE I QLDL 

ISSIllllllllliltllllllilllllllllMIIIIISIIiillllllllllllltSI 

VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRPI KGPREIQLDL 

EMITVNTVINFRGSSVIRLRIYVSQYPF 448 
llllllllll!llillllillllllllt 



RESULT 4 
US-10-041-016-2 

Sequence 2, Application US/10041016 
Publication No. US20020165151A1 
GENERAL INFORMATION: 

APPLICANT: Jacobs, Kenneth 
McCoy, John M. 
Racie, Lisa A. 
LaVallie, Edward R. 
Merberg, David 
Treacy, Maurice 
Evans , Cheryl 
Agostino, Michael 
Lu, Zhijian 
Honjo, Tasuku 
TITLE OF INVENTION: SECRETED PROTEINS 
NUMBER OF SEQUENCES : 2 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Genetics Institute, Inc. 
STREET: 87 CatnbridgePark Drive 
CITY: Cambridge 
STATE: MA 
COUNTRY: U.S.A. 
SIP: Q2140 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER; IBM PC compatible 
OPERATING SYSTEM: PC- DOS/MS -DOS 
SOFTWARE: Patent In Release #1.0, Version #1.3 0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/041,016 
FILING DATE: 07-Jan-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/09/083,002 
FILING DATE: 21-MAR-1998 
ATTORNEY/ AGENT INFORMATION: 



; NAME: Sprunger, Suzanne A. 

REGISTRATION NUMBER: P-4 1,323 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617) 498-8284 
TELEFAX: (617) 876-5851 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS : 

LENGTH: 448 amino acids 
TYPE; amino acid 

STRANDEDNESS : No. US20020165151A1 Relevant 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
US-10-041-016-2 

Query Match 100.0%; Score 2533; DB 14; Length 448; 

Best Local Similarity 100.0%; Pred. No. l.9e-192; 

Matches 448; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MPGI KRI LTVT I LALCLPS PGNAQAQCTNGFDLDRQSGQCLDI DE CRTI PEACRGDMMC V 60 

tltlttM!!litlllll!i!lllllllilill!llltllilil!lllltllltSltt!l 

Db 1 MPGI KRI LTVT I LALCLPS PGNAQAQCTNGFDLDRQSGQCLDI DECRT I PEACRGDMMCV 60 

Qy 61 NQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD 120 

lilillililllllllllllllllllllllllllllllllllllSlllilitllllltSI 
Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAA&PPLSAPNYPTI SRPLICRFGYQMD 120 

Qy 121 ESNQCVDVDECATDSHQCN PTQ I C I NTEGGYTCSCTDGYWLLEGQCLD I DECRYGYCQQL 180 

lllillllilllillllllllllilllllllllllllilllllllillllilllllllSI 
Db 121 ESNQCVDVDECATDSHQCN PTQ I C I NTEGGYTCSCTDGYWLLEGQCLD I DECRYGYCQQL 180 

Qy 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSF I CRCDPGYELEE 240 

lllliiIlllllililllllSlllllliilll!iii!IIMIllSllll!illl!IIISI 

Db 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSF I CRCDPGYELEE 240 

Qy 241 DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 300 

ll(llllllllIllllllllllillllllllllllilliSIIII!lllilllll(lllll 
Db 241 DGVHCSDMDECSFSEFLCQHECWQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 300 

Qy 301 LQQTCYNLQGGFKCIDPI RCEEPYLRI SDNRCMCPAENPGCRDQFFTI LYRDMDWSGRS 360 

illllilili!lllillllMlill(lllllillll!lllllllSllllllllllillll 

Db 301 LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 360 

Qy 3 61 VPAD I FQMQATTRYPGAY Y I FQ I KSGNEGREF YMRQTGPI S ATLVMTRP I KG PRE I QLDL 420 

[lllllMMIIIMMIIMIIIiillllllllMIIilllllllllllllllllllll 

Db 361 VPADI FQMQATTRYPGAYYIFQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREI QLDL 420 

Qy 421 EMI TVNT VTNFRGSS VI RLRI YVSQYPF 448 

liMKIIIKIIIIIIillllllllll 
Db 421 EMI TVNTVINFRGSSVI RLRI YVSQYPF 448 



RESULT 5 

US-10-199-672-408 

; Sequence 408, Application US/10199672 
; Publication No. US2003 0148442A1 
; GENERAL INFORMATION: 



APPLICANT: Baker, Kevin P. 
APPLICANT: Chen,Jian 
APPLICANT : Desnoyers , Luc 
APPLI CANT : Goddard , Audrey 
APPLICANT: Godowski,Paul J. 
APPLICANT: Gumey, Austin L. 
APPLICANT: pan, James 
APPLICANT: Smith, Victoria 
APPLICANT: Watanabe, Col in K. 
APPLICANT: Wood, William I. 
APPLICANT : Zhang , Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OP INVENTION; ACIDS ENCODING THE SAME 
FILE REFERENCE; P343 0R1C1 

CURRENT APPLICATION NUMBER: US/10/199,672 

CURRENT FILING DATE: 2002-07-18 

PRIOR APPLICATION NUMBER: US/1 0/052 , 586 

PRIOR FILING DATE: 2002-01-15 

PRIOR APPLICATION NUMBER: 60/059263 

PRIOR FILING DATE: 1997-09-18 

PRIOR APPLICATION NUMBER ; 60/059266 

PRIOR FILING DATE: 1997-09-18 

PRIOR APPLICATION NUMBER : 60/062250 

PRIOR FILING DATE: 1997-10-17 

PRIOR APPLICATION NUMBER: 60/063120 

PRIOR FILING DATE: 1997-10-24 

PRIOR APPLICATION NUMBER: 60/063121 

PRIOR FILING DATE: 1997-10-24 

PRIOR APPLICATION NUMBER: 60/063486 

PRIOR FILING DATE : 1997-10-21 

PRIOR APPLICATION NUMBER: 60/06354 0 

PRIOR FILING DATE: 1997-10-28 

PRIOR APPLICATION NUMBER: 60/063541 

PRIOR FILING DATE: 1997-10-28 

PRIOR APPLICATION NUMBER : 60/063544 

PRIOR FILING DATE: 1997-10-28 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS: 612 
SEQ ID NO 4 08 

LENGTH : 448 

TYPE: PRT 

ORGANISM: Homo Sapien 
US-10-199-672-408 



Query Match 99.8%; 
Best Local Similarity 99.8%; 
Matches 447; Conservative 



Score 2527; DB 12; 
Pred. No. 5.7e-l92; 
0; Mismatches 1; 



Length 448; 



Indels 



0 ; Gaps 



Qy 1 MPGIKRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 60 

IIIIIIIflllllllllllllllllllllllllllllllllMillllllllllllllll 
Db 1 MPGIKRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 60 

Qy 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 120 

IllllllilllilllllllllllSlllllllSlllllllllllllllllillllllllli 
Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD 120 



Qy 121 ESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 180 



Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



[IIIIIIIIIIllllflllllllMlllllllfMllIIIIIIIIIIIIIIIIIIII! 

3SNQCVDVDECATDSHQCNPTQI CI NTEGG YTCS CTDGYWLLEGQCLDI DECRYGYCC 

^NVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVWTYGSFICRCDPGYEI 

lillliiltilttllllillllttlilflllltlltllilillMII ItiSlllill 
^NVPGSYSCTCNPGPTLNEDGRSCQDVNECATENPCVQTCVNTYGSLICRCDPGYEl 

X3VHCSDMDECSFSEFLCQHE CVNQPGTYPCS C P PG YI LLDDNRS CQDI NECEHRNH1 

[IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMMIMIIIIMIMIIMI! 

3GVHCSDMDECS FSE FLCQHECVNQ PGT YFCS C P PGYI LLDDNRSCQDI NECEHRNH1 
jQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSC 

ItllillllllltilllltlMISllllllllllIllliiilSIMIillMilMII 

-"QQTCYNLQGGFKCIDPIRCEEPYLRISDHRCMCPAENPGCRDQPFTILYRDMDWSG 

/PAD I FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRP I KGPREIQI 

[IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIfllflllllf 
/PAD I FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRP I KG PRE I QI 

SMI TVNTVINFRGSSVI RLRI YVSQYPF 448 
Itlllllttilllllliillllllitll 



RESULT 6 

US-10-187-749-408 

Sequence 408, Application US/10187749 
Publication No. US20030153036A1 
GENERAL INFORMATION: 

APPLICANT: Baker, Kevin P. 

APPLICANT: Chen,Jian 

APPLI CANT : Desnoyers , Luc 

APPLICANT: Goddard, Audrey 

APPLICANT: Godowski , Paul J. 

APPLICANT: Gurney, Austin L. 

APPLI CANT : Pan , James 

APPLICANT: Smith, Victoria 

APPLICANT: Watanabe, Colin K. 

APPLICANT: Wood, William I. 

APPLI CANT : Zhang , Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C1 

CURRENT APPLICATION NUMBER: US/ 10/ 187, 749 

CURRENT FILING DATE: 2002-07-01 

PRIOR APPLICATION NUMBER: US/10/052,586 

PRIOR FILING DATE: 2002-01-15 

PRIOR APPLICATION NUMBER: 60/059263 

PRIOR FILING DATE: 1997-09-18 

PRIOR APPLICATION NUMBER: 60/059266 

PRIOR FILING DATE; 1997-09-18 

PRIOR APPLICATION NUMBER: 60/062250 

PRIOR FILING DATE: 1997-10-17 

PRIOR APPLICATION NUMBER : 60/063120 

PRIOR FILING DATE: 1997-10-24 

PRIOR APPLICATION NUMBER: 60/063121 

PRIOR FILING DATE: 1997-10-24 



; PRIOR APPLICATION NUMBER: 60/063486 

; PRIOR FILING DATE: 1997-10-21 

; PRIOR APPLICATION NUMBER: 60/063540 

; PRIOR FILING DATE: 1997-10-28 

; PRIOR APPLICATION NUMBER: 60/063541 

; PRIOR FILING DATE: 1997-10-28 

; PRIOR APPLICATION NUMBER: 60/063544 

; PRIOR FILING DATE: 1997-10-28 

; Remaining Prior Application data removed - See File Wrapper or PALM. 

; NUMBER OF SEQ ID NOS: 612 

; SEQ ID NO 408 

; LENGTH: 448 

TYPE : PRT 

; ORGANISM; Homo Sapien 
US-10-187-749-408 



Query Match 99.8%; Score 2527; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 5.7e-192; 

Matches 447; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


MPGI KRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 


60 


Db 


1 


MPGI KRI LTVTI LALCLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 


60 


Qy 


61 


NQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD 

iiiitiiiiiiiiiiitiiiiiniiiitiiiiiiiniiiiiiiiMiiiiiiiiiiii 

NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 


120 


Db 


61 


120 


Qy 


121 


ESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 

lllllllllllllllflllllfljllllllllllfllllllllllllllllllfllllli 
ESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 


180 


Db 


121 


180 


Qy 


181 


CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 

IIMMllllilllllMIIIMMIililllMMMIIIIIIIi! 1 1 1 1 1 M I ! 1 1 1 

CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSLICRCDPGYELEE 


240 


Db 


181 


240 


Qy 


241 


DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCN 

llllllllMIIIIIUMIIIIIillllllllllllllillllllililllSIIIIIII 
DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCP PGY I LLDDNRS CQD I NECEHRNHTCN 


300 


Db 


241 


300 


Qy 


301 


LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 

tilll]||l!llitllltitlilllttttlillllillll!lil]liilltiltliilt! 

LQQTCYNLQGG FKCI DP IRCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 


360 


Db 


301 


360 


Qy 


361 


VPAD I FQMQATTRY PGAY Y I FQI KSGNEGREFYMRCTGPI SATLVMTRPI KGPREI QLDL 

IIIIIIIIIIIIIIIIIIIIIIIIIIIKIIIfllllllllllfllllllllllllllll 

VPAD I FQMQATTRYPGAYY I FQ I KSGNEGRE F YMRQTGP I SATLVMTR P I KGPRE I QLDL 


420 


Db 


361 


420 


Qy 


421 


EMITVNTVINFRGSSVIRLRIYVSQYPF 448 

IIIIIIIIIIHIIIIIIIIMIIIIM 
EMITVNTVINFRGSSVIRLRIYVSQYPF 44 8 




Db 


421 





RESULT 7 

US-10-194-457-408 

; Sequence 408, Application US/10194457 
; Publication No. US2 003 0153037A1 



GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Chen,Jian 
APPLI CANT : Desnoyers , Luc 
APPLICANT: Goddar d , Audrey 
APPLICANT: Godowski , Paul J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Pan, James 
APPLICANT: Smith, Victoria 
APPLICANT: Watanabe , Colin K. 
APPLICANT: Wood, William I. 
APPLICANT: Zhang, Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 

TITLE OF INVENTION: ACIDS ENCODING THE SAME 

FILE REFERENCE: P343 0R1C296 

CURRENT APPLICATION NUMBER: US/10/194 , 457 

CURRENT FILING DATE: 2002-07-11 

PRIOR APPLICATION NUMBER: 10/052586 

PRIOR FILING DATE: 2002-01-15 

PRIOR APPLICATION NUMBER : 60/059263 

PRIOR FILING DATE: 1997-09-18 

PRIOR APPLICATION NUMBER: 60/059266 

PRIOR FILING DATE: 1997-09-18 

PRIOR APPLICATION NUMBER ; 60/062250 

PRIOR FILING DATE: 1997-10-17 

PRIOR APPLICATION NUMBER: 60/063120 

PRIOR FILING DATE: 1997-10-24 

PRIOR APPLICATION NUMBER: 60/063121 

PRIOR FILING DATE: 1997-10-24 

PRIOR APPLICATION NUMBER: 60/063486 

PRIOR FILING DATE: 1997-10-21 

PRIOR APPLICATION NUMBER : 60/06354 0 

PRIOR FILING DATE: 1997-10-28 

PRIOR APPLICATION NUMBER: 60/063 541 

PRIOR FILING DATE: 1997-10-28 

PRIOR APPLICATION NUMBER: 60/063544 

PRIOR FILING DATE: 1997-10-28 

Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 612 
SEQ ID NO 408 

LENGTH: 448 

TYPE: PRT 

ORGANISM: Homo Sap i en 
US-10-194-457-408 



Query Match 99.8%; 
Best Local Similarity 99.8%; 
Matches 447; Conservative 



Score 2527; DB 12 
Pred. No. 5.7e-192 
0; Mismatches 1 



Length 448; 
Indels 0; 



Gaps 



Qy 

Db 



1 MPGIKRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 60 

mi i minimum i mi i urn iii mini 1 1 mi mini imii 

1 MPGIKR1LTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 60 



Qy 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLICRFGYQMD 120 

llllllillilMlllillllllllllllllllllMlllllllMIMIIIIIMIIIi 

Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLICRFGYQMD 120 





121 


Db 


121 




181 


Db 


181 


wy 


241 


Db 


241 




301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



llllllllllllllllMllillllililllllllllllllMilllllllllllllK 

SNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGyWLLEGQCLDIDECRYGYCQQL 
\NVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 

ii iii ii iiiiiim i mi i iii mi i mil i in 1 1 1 nun iminimi 

^NVPGSYSCTCNPGFTLNEDGRS CQDVNECATENPCVQTCVNTYGSLI CRCDPGYELEE 
3VHCSDMDECS FSEFLCQHE CVNQ PGT YFCSCP PGYI LLDDNRS CQDI NECEHRNHTCN 

iiiiiiiiiiiiiiiiiiiiiiiiimmiiiiiiiimiiimiiiiiiimi 

3VHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRS CQDI NECEHRNHTCN 

2QTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 

Mlllilllllllllimillllllllllilllillllllllllllllimilllll 
X2TCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 

?ADI FQMQATTR YPGAYY I FQ I KSGNEGREFYMRQTGPI SATLVMTRP I KGPREI QLDL 

I I I I I I i II I I I IN I III III I I I 11 I I I 11 I ( I I I 1 I III [ I I I I I I III I 1 I I III 
?ADI FQMQATTR YPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRP I KGPREI QLDL 

AI TVNTVINFRGSS VI RLRI YVSQYPF 44 8 

iiiiiiiiiimmmmiiiii 



RESULT 8 

US-10-184-642-408 

Sequence 408, Application US/10184642 
Publication No. US20030157635A1 
GENERAL INFORMATION: 

APPLICANT: Baker, Kevin P. 

APPLICANT: Chen,Jian 

APPLI CANT : Desnoyers , Luc 

APPLICANT: Goddard, Audrey 

APPLICANT: Godowski , Paul J. 

APPLICANT: Gurney, Austin L. 

APPLICANT: Pan, James 

APPLICANT: Smith, Victoria 

APPLICANT: Watanabe, Colin K. 

APPLICANT: Wood, William I . 

APPLICANT: Zhang, Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 

TITLE OF INVENTION: ACIDS ENCODING THE SAME 

FILE REFERENCE: P3430R1C194 

CURRENT APPLICATION NUMBER: US/10/184 , 642 

CURRENT FILING DATE: 2002-06-27 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 612 
SEQ ID NO 408 
LENGTH : 448 
TYPE: PRT 

ORGANISM: Homo Sapien 
US-10-184-642-408 



Query Match 99.8%; Score 2527; DB 12 

Best Local Similarity 99.8%; Pred. No. 5.7e-192 
Matches 447; Conservative 0; Mismatches 1 



Length 443; 

Indels 0; Gaps 0; 



Qy 


1 


MPG I KRI LTVTI LALCLPS PGNAQAQCTNGFDLDRQSGQCLD I DECRTI PEACRGDMMCV 
1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 


60 


Db 


1 


I M 1 1 1 1 i 1 M 1 t 1 I 1 1 I 1 I 1 M ! 1 1 1 1 1 1 11 M 1 1 1 I I I 1 1 I I I M !!! I 1 1 1 1 1 1 1 1 1 
MPGIKRI LTVTI LALCLPS PGNAQAQCTNGFDLDRQSGQCLD I DBCRT I PEACRGDMMCV 


60 


Qy 


61 


NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLICRFGYQMD 

liiMlillilllilllllllllilllllliilllllliilliilillllliilllilil 
NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLICRFGYQMD 


120 


Db 


61 


120 


Qy 


121 


ESNQCVDVDECATDSHQCNPTQI C1NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 


180 


Db 


121 


IIIIIIIIIMIIIIllillllllillllllKIIMIIIilllllllllliilKllil 

ESNQCVDVDECATDSHQCNPTQICIMTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 


180 


Qy 


181 


CANVPGSYSCTCNPGFTLWEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 

DlililllMtlltMIIMIllllilllMliiilMllllllll iiiillltlil) 

CANVPGSYSCTCNPGFTLMEDGRSCQDVNECATENPCVQTCVNTYGSLICRCDPGYELEE 


240 


Db 


181 


240 


Qy 


241 


DGVHCSDMDECSFSEFLCQHECVNQPGTYFCS CP PGY I LLDDNRSCQD INECEHRNHTCN 

lllllllllllillllllKMIIKIIIIIIllllllllillilillllllllllllli 

DGVHCSDMDECSFSEFLCQHECWQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 


300 


Db 


241 


300 


Qy 


301 


LQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 

1 1 I i 1 I 1 1 1 1 1 1 1 t 1 1 1 II 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II t 1 1 t I 1 1 1 1 1 1 1 
M 1 1 1 1 M 1 1 1 1 1 I 1 1 1 1 1 I 1 1 1 1 1 II 1 1 I I 1 1 1 1 1 II [ I 1 1 1 1 1 [ 1 1 1 I I 1 1 1 1 I [ I 1 1 

LQQTCYNLQGGFKCIDPI RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 


360 


Db 


301 


360 


Qy 


361 


VPADI FQMQATTRYPGAY Y I FQI KSGNEGREF YMRQTGP I SATLVMTR P I KG PRE I QLDL 

ll(llltlllllll!IIIIII(!l!!illlll|jilllII!III[lllll!IIIllilll 

VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRP IKGPREIQLDL 


420 


Db 


361 


420 


Qy 


421 


EMITVNTVINFRGSSVI RLRI YVSQYPF 448 

iilMliitltllllMIIIIMIMli 




Db 


421 


EMI TVNTVINFRGSSVI RLRI YVSQYPF 448 




RESULT 9 
US- 10- 196 


-747- 


-408 





Sequence 408, Application US/10196747 
Publication No. US20030162250A1 
GENERAL INFORMATION: 

APPLICANT: Baker, Kevin P. 

APPLICANT: Chen,Jian 

APPLI CANT ; Des noyer s , Luc 

APPLICANT : Goddard , Audrey 

APPLICANT: Godowski , Paul J. 

APPLICANT: Gurney, Austin L. 

APPLICANT: Pan, James 

APPLICANT: Smith, Victoria 

APPLICANT: Watanabe, Colin K. 

APPLICANT: Wood, William I. 

APPLICANT: Zhang, Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 

TITLE OF INVENTION: ACIDS ENCODING THE SAME 

FILE REFERENCE: P3430R1C346 

CURRENT APPLICATION NUMBER: US/1 0/196 , 747 

CURRENT FILING DATE: 2002-07-16 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 612 



; SEQ ID NO 408 
LENGTH; 448 
TYPE: PRT 

ORGANISM: Homo Sap i en 
US-10-196-747-408 

Query Match 99.8%; Score 2527; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 5.7e-192; 

Matches 447; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 MPGI KR I LTVTI LALCLPS PGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 60 

IIMillt!i!ltSlllitll!lliilillllllliIllltllll!ll!SIMItl!!il 

Db 1 MPGI KR I LTVT I LALCLP S PGNAQAQCTMGFDLDRQSGQCLD I DECRTI PEACRGDMMCV 60 

Qy 61 NQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD 120 

Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD 120 

Qy 121 ESNQCVDVDECATDSHQCMPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 180 

IIIIMIIIIIIIIfllllllMIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIfll 

Db 121 ESWQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 180 

Qy 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 240 

illlllllllillilillllltlllllllllililflilllliilil llllllllllll 
Db 181 CANVPGSYS CTCNPGFTLNEDGRS CQDVNECATENP CVQTCVNTYGSLI CRCDPGYELEE 240 

Qy 241 DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCP PGYI LLDDNRSCQDINECEHRNHTCN 300 

IllilllllllSlllllllllllSllliSilllllllllilillllllSlllllllIlll 

Db 241 DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 300 

Qy 301 LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 360 

[!I!I!l!IM[l![!l!!!I[(!!!!II!i!i!![![!!I!!ll!{j![I(!l!I!i!l 

Db 301 LQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 3 60 

Qy 361 VPADI FQMQATTRYPGAYYI FQIKSGNEGREFYMRQTGPI SATLVMTRPIKGPREIQLDL 420 

lilllllllllllllllllllllllMIIIMllllllllllNIIMIIIIIIllllll 
Db 361 VPADI FQMQATTRYPGAYYI FQIKSGNEGREFYMRQTGPI SATLVMTRPIKGPREIQLDL 420 

Qy 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 44 8 

iiiiiiiiiimiiMiiiiimm 

Db 421 EMI TVNTVI NFRGS SVI RLR I YVSQYPF 448 



RESULT 10 
US-10-173-689-408 

; Sequence 408, Application US/10173689 

; Publication No. US20030166104A1 

; GENERAL INFORMATION: 

; APPLICANT: Baker, Kevin P. 

; APPLICANT: Chen,Jian 

; APPLI CANT : Desnoyers , Luc 

; APPLI CANT : Godda rd , Audrey 

; APPLICANT: GOdOWSki , Paul J. 

; APPLICANT: Gumey, Austin L. 

; APPLICANT: Pan, James 

; APPLICANT; Smith, Victoria 

; APPLICANT: Watanabe, Colin K. 



APPLICANT: Wood, William I. 
APPLICANT: Zhang, Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 

TITLE OF INVENTION: ACIDS ENCODING THE SAME 

FILE REFERENCE: P343 0R1C10 

CURRENT APPLICATION NUMBER: US/10/173 , 689 

CURRENT FILING DATE: 2002-06-17 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS: 612 
SEQ ID NO 4 08 
LENGTH: 448 
TYPE: PRT 

ORGANISM: Homo Sap i en 
US-10-173-689-408 

Query Match 99.8%; Score 2527; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 5.7e-192 ; 

Matches 447; Conservative 0; Mismatches 1; Indels 0; Gaps 
MPGI KRI LTVT ILALCLPS PGNAQAQCTNGFDLDRQSGQCLD I DECRTI PEACRGDMMCV 

lilllilillllllllllilllllliiiliiillllllilllllllililllillillll 

MPGI KRI LTVT I LALCLPS PGNAQAQCTNGPDLDRQSGQCLD I DECRT I PEACRGDMMCV 
NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPL1 CRFGYQMD 

IISIIIItlltlMllllIlltMllllilllllltMSIIillllSllllSilliitll 

NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 
ESNQCVDVDECATDSHQCN PTQ I C INTEGGYTCS CTDGYWLLEGQCLD I DECRYGYCQQL 

IlilllilSlllllllllllllilllllllllllllllSlllflllSHIISIllllill 

ESNQCVDVDECATDSHQCN PTQ I C INTEGGYTCS CTDGYWLLEGQCLDI DECRYGYCQQL 
CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE : 

IIIII11I11IIIIIIIIIIIIIIMIIIIIIIIIIIISI1IIII1S lllllflllill 

CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSLICRCDPGYELEE : 
DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN : 

lllillMlllllllllllllilllillllllllilliilllllllSlllllllllilll 

DGVHCSDMDECS FS EFLCQHECVNQ PGTYF CSCP PGY I LLDDNRS CQD I NECEHRNHTCN : 
LQQTCYNLQGG F KCI D P I RCEEP YLRI SDNRCMCPAEN PGCRDQP FT I LYRDMDWSGRS : 

lllllllllllillllllllllllllllll llllilllllllllilllllllillllill 

LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS : 
VPADI FQMQATTRY PGAY YI FQI KSGNEGREF YMRQTG P I SATLVMTRPI KG PRE I QLDL - 

IllilllllllllllllilMIMIMillillllllllllllMllllilllllllKI 

VPADI FQMQATTR YPGAY Y I FQI KSGNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDL - 
EMITVNTVINFRGSSVIRLRIYVSQYPF 448 

(Miiimiiiiiimiimiim 



RESULT 11 

US-10-173-690-408 

Sequence 408, Application US/10173690 
Publication No. US20030166105A1 
GENERAL INFORMATION: 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


QY 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



APPLICANT: Baker/ Kevin P. 
APPLICANT: Chen,Jian 
APPLICANT : Desnoyers , Luc 
APPLICANT : Goddard . Audrey 
APPLICANT: GodowsKi , Paul J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Pan, James 
APPLICANT: Smith, Victoria 
APPLICANT: Watanabe, Colin K. 
APPLICANT: Wood, William I. 
APPLI CANT : Zhang , Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEI C 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C9 

CURRENT APPLICATION NUMBER: US/10/173,690 
CURRENT FILING DATE: 2002-06-17 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS: 612 
SEQ ID NO 408 

LENGTH : 448 

TYPE: PRT 

ORGANISM: Homo Sap i en 
US-10-173-690-408 

Query Match 99.8%; Score 2527; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 5.7e-192; 

Matches 447; Conservative 0; Mismatches 1; Indels 0; Gaps 



QY 


i 


MPGI KR I LTVT I LALCLP S PGNAQAQCTNGFDLDRQ SGQCLD I DECRT I PEACRGDMMCV 


60 


Db 


i 


MPGI KR I LTVT I LALCL PS PGNAQAQCTNGFDLDRQSGQCLDI DECRT I PEACRGDMMCV 


60 


QY 


61 


NQNGGYLCI PRTNPVTRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD 

lllllllllfllllllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIflMIIII 
NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD 


120 


Db 


61 


120 


Qy 


121 


ESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 

illillllilMllllltlllillllllllilliiltllllllillllillMltltlll 


180 


Db 


121 


ESNQCVDVDECATDSHQCNPTQI CI NTEGGYTCS CTDGYWLLEGQCLDI DECRYGYCQQL 


180 


Qy 


181 


CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSF I CRCDPGYELEE 
SllllliMllillllllllllllllllllllllllllillllilil llllllllilil 


240 


Db 


181 


CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSLI CRCDPGYELEE 


240 


Qy 


241 


DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 


300 


Db 


241 


DGVHCSDMDECSFS EFLCQHECVNQ PGTYFCSCP PG YI LLDDNRS CQD I NECEHRNHTCN 


300 


Qy 


301 


LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENFGCRDQPFTILYRDMDWSGRS 

lilllllllMIIIIIIIIIIIIIIIIIIIIIMIIIIMIfilllllllllllllllll 

LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 


360 


Db 


301 


360 


Qy 


361 


VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREI QLDL 


420 


Db 


361 


lllililllllilt!liill!illt!itillilMlill!!iSi!ill!lllltilllil 

VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRPI KGPREI QLDL 


420 


Qy 


421 


EM I TVNTVI NFRGS S VI RLR I YVSQY P F 448 





[IIIIIIIIfllllillllllllKIM 

Db 421 EMI TVNTVI NFRGS SVI RLR I YVS QYPF 448 



RESULT 12 

US-10-173-691-408 

Sequence 408, Application US/10173691 
Publication No. US2003G166106A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Baker, Kevin P. 
Chen, Jian 
Desnoyers , Luc 
Goddard , Audrey 
Godowski , Paul J. 
Gumey, Austin L. 
Pan, James 
Smith, Victoria 
Watanabe, Colin K. 
Wood, William I. 
Zhang, Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEI C 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C13 

CURRENT APPLICATION NUMBER : US/10/173, 691 
CURRENT FILING DATE: 2002-06-17 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS: 612 
SEQ ID NO 408 
LENGTH: 448 
TYPE: PRT 

ORGANISM: Homo Sap i en 
US-10-173-691-408 

Query Match 99.8%; Score 2527; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 5.7e-192; 

Matches 447; Conservative 0; Mismatches 1; Indels 0; Gaps 

>5PG I KRI LTVT I LALCLPS PGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 

llllllllllllllllllllllllllllllilllllllllllllllllllllllllllll 
4PGI KRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV ■ 

>JQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYOMD 

lltl!illlll!lilltilIMIIIIIIIIIIIIIIillililSIISII!lllll!llll 

(JQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 
3SNQCVDVDECATDSHQCNPTQI CI NTEGGYTCS CTDGYWLLEGQCLD I DECRYGYCQQL 

lilllilltlllllllllSlfllilillllilllllllflliMillllltillllllli 

3SNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL : 
^NVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEE : 

llliltiiltillliilillllllillilllililiiilliililii tllllltliti! 

ZANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSLI CRCDPGYELEE i 
X5VHCSDMDECS FS EFLCQHE CVNQ PGTYFCS CPPGYI LLDDNRSCQDI NECEHRNHTCN : 

illllllllillilllillllittiillllilliiiMiitllilillllliiillMli 



Gy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 



Qy 3 01 LQQTC YNLQGGFKC I DP I RCEEP YLR I SDNRCMCPAENPGCRDQPFT I LYRDMD WSGRS 360 

il!IIIIIIIIMIIIIIillllllllllllllllllllllllll!llllllilllllll 

Db 301 LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 360 

Qy 361 VPAD I FQMQATTRYPGAY Y I FQI KSGNEGREF YMRQTG PI S ATLVMTRP I KG PRE I QLDL 420 

MlllllllilllllllMinilllllllllillllllllllllSlltllMllilllS 

Db 361 VPAD I FQMQATTRYPGAY YI FQI KSGNEGREF YMRQTG P I S ATLVMTRP I KG PRE I QLDL 420 

Qy 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 

ItlllllllllMllllMMIIIIIII 

Db 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 



RESULT 13 
US-10-173-692-408 

Sequence 408, Application US/10173692 
Publication No. US20030166188A1 
GENERAL INFORMATION: 

APPLICANT: Baker, Kevin P. 

APPLICANT: Chen,Jian 

APPLI CANT : Desnoyers , Luc 

APPLICANT : Goddard , Audrey 

APPLICANT: Godowski , Paul J. 

APPLICANT: Gurney, Austin L. 

APPLICANT: Pan, James 

APPLICANT: Smith, Victoria 

APPLICANT: watanabe, Colin K. 

APPLICANT: Wood, William I. 

APPLI CANT : Zhang , Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 

TITLE OF INVENTION: ACIDS ENCODING THE SAME 

FILE REFERENCE: P3430R1C20 

CURRENT APPLICATION NUMBER: US/ 10/173 , 692 

CURRENT FILING DATE: 2002-06-17 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS: 612 
SEQ ID NO 408 
LENGTH: 448 
TYPE: PRT 

ORGANISM: Homo Sapien 
US-10-173-692-408 



Query Match 99.8%; Score 2527; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 5.7e-192; 

Matches 447; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 MPGIKRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 60 

l!I!IM(IllIli(ll!II!l!IMIIII(lll(l!![I!!l!ilII((!!!!II!llI 

Db 1 MPGIKRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 60 

Qy 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 12 0 

I(II!!I!!!(M!l!!!IIII(!!I(!i!!f!!ll(!IIIII(M|[|iI(ll!l!ll! 

Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 120 



Qy 121 ESNQCVD VDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 180 

Db 121 ESNQCVDVDECATDSHQCNPTQ I CI NTEGG YTCS CTDGYWLLEGQCLDI DECRYGYCQQL 180 



Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiKiiiiiiiiiiiiiii iniimim 

^NVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSLICRCDPGYELEE 
DGVHCSDMDECSFSEFLCQHECVNQPGTypCSCPPGyiLLDDNRSCQDINECEHRNHTCN 

lll!llllllll!filllll!l!l!lllllll!l!llllllflfllllllllllllll!l 

3GVHCSDMDECSFSEFLCQHECVNQPGTyFCSCPPGYILLDDNRSCQDINECEHRNHTCN 

jQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 

llllMlilltilllllillllllMlllilllllltlllillllMliiliiillllll 
jQQTCYNLQGGFKCIDPIRCEEPYLRlSDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 

/PAD I FQMQATTRYPGAY Y I FQI KSGNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDL 

MI!illllllllllllllliilll!lll!IIIISi!llllilllllll!llilllllli 

/PAD I FQMQATTRYPGAY Y I FQ I KSGN EGREF YMRQTG P I SATLVMTRP I KG PRE I QLDL 
3M I TVNTVI NFRGSS VI RLR I YVSQYPF 448 

imiMIMIIIIIIIIMMIIIIll 



RESULT 14 
US-10-173-694-408 

Sequence 408, Application US/10173694 
Publication No. US20030166107A1 
GENERAL INFORMATION: 

APPLICANT: Baker, Kevin P. 

APPLICANT : Chen, Jian 

APPLICANT: Desnoyers , Luc 

APPLICANT : Goddard , Audrey 

APPLICANT: Godowski , Paul J. 

APPLICANT: Gumey, Austin L. 

APPLICANT : Pan , James 

APPLICANT: Smith, Victoria 

APPLICANT: Watanabe, Col in K. 

APPLICANT: Wood, William I. 

APPLICANT: Zhang, Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C19 

CURRENT APPLICATION NUMBER: US/ 10/173 , 694 
CURRENT FILING DATE: 2002-06-17 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS: 612 
SEQ ID NO 408 
LENGTH: 448 
TYPE: PRT 

ORGANISM: Homo Sapien 
US-10-173-694-408 



Query Match 99.8%; Score 2527; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 5.7e-192; 

Matches 447; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 



1 MPGI KRI LTVT I LALCLP S PGNAQAQCTNGFDLDRQSGQCLD I DECRT I PEACRGDMMCV 60 

IlllSlillllillitllSllilllllllSlllllllIlllilllllllflliSIIIIII 



Db 


1 


MPGI KRI LTVTI LALCLPS PGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 


60 


Qy 


61 


NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD 


12 0 


Db 


61 


IIII!!i[j!(I[MII!!IMi!!!l!il!!![I!!!liIII!lllllIi!IIIIMl( 

NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 


120 


Qy 


121 


ESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 


180 


Db 


121 


ESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 


180 


Qy 


181 


CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 


240 


Db 


181 


IIUINIMIIIIillllllllllllilllllllllllllilllll llilllllilll 

CANVPGSYSCTCNPGFTLNEDGRS CQDVNECATENPCVQTCVNTYGSLI CRCDPGYELEE 


240 


Qy 


241 


DGVHCSDMDECS FS EFLCQHECVNQPGTYF CS CP PGY ILLDDNRS CQDI NECEHRNHTCN 


300 


Db 


241 


llllllllllllllllllllllllll!tlllllll!IIIIISIIIMSII!lllllllll 

DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 


300 


Qy 


301 


LQQTCYNLQGGFKCIDP IRCEEPYLRI SDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 


360 


Db 


301 


llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

LQQTCYNLQGGFKCIDPIRCEEPYLRI SDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 


360 


Qy 


361 


VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRP I KGPREIQLDL 
lltltllllSlillllllllllllSllllttlliillllllllfllilillillllilll 


420 


Db 


361 


VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRP I KG PRE I QLDL 


420 


Qy 


421 


EMITWTVINFRGSSVIRLRIYVSQYPF 448 
MlllilliSlllltlilliSllillli 




Db 


421 


EM I TVMTVI NFRGS SVI RLRI YVS QYPF 448 




RESULT 15 
US-10-173 


-698 


-408 





Sequence 408, Application US/10173698 
Publication No. US20030166108A1 

GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Chen,Jian 
APPLI CANT : Desnoyers , Luc 
APPLICANT: Goddard , Audrey 
APPLICANT: Godowski , Paul J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Pan, James 
APPLICANT: Smith, Victoria 
APPLICANT: Watanabe, Colin K. 
APPLICANT: Wood, William I. 
APPLI CANT : Zhang , Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 

TITLE OF INVENTION: ACIDS ENCODING THE SAME 

FILE REFERENCE: P3430R1C12 

CURRENT APPLICATION NUMBER: US/10/173,698 

CURRENT FILING DATE: 2002-06-17 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS: 612 
SEQ ID NO 408 
LENGTH: 448 



; ORGANISM: Homo Sapien 
US-10-173-698-408 



Query Match 99.8%; Score 2527; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 5.7e-192; 

Matches 447; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


MPG I KR I LTVT I LALCLPS PGNAQAQCTNGFDLDRQSGQCLDI DECRT I PEACRGDMMCV 
1 I 1 1 I 1 I 1 1 ! 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 I 1 I 1 1 1 1! 1 1 1 I I I 1 1 ! 1 ! 1 1 1 1 1 I 1 1 ! ! 1 I 1 


60 


Db 


1 


1 i I i t I I II II t 1 t 1 M II i t 1 1 1 1 II M 1 1 1 1 1 II II 1 1 I II 1 M I M II i 1 1 I i i 1 II 
MPGI KRILTVTI LALCLPS PGNAQAQCTNGFDLDRQSGQCLDI DECRT I PEACRGDMMCV 


60 


Qy 


61 


NQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD 

[ 1 1 ! 1 ! 1 1 1 ! [ 1 ! 1 1 1 1 1 1 1 ! [ 1 1 ! ! ! 1 1 I [ [ ! 1 1 1 1 1 1 1 1 [ 1 1 1 1 ! 1 1 1 ! I 1 1 ! ! 1 ! 1 1 

1 1 1 1 1 1 1 It 1 1 II i 1 11 I 1 11 1 It 1 ) 1 1 1 1 1 1 1 i II 1 1 1 It 1 1 1 1 1 1 t M 1 1 1 II 1 II 1 1 
NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 


120 


Db 


61 


120 


Qy 


121 


ESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 
1 1 1 I 1 1 I I 1 I 1 1 !([ 1 1 1 1 1 1 1 I 1 1 ! 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 t 1 1 1 !! 1 1 I 1 


180 


Db 


121 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 I M 1 1 t 1 1 1 1 1 1 1 II 1 I M I 1 1 I 1 1 M 1 1 i 1 III 
ESNQCVDVDECATDSHQCN PTQI C I NTEGG YTCS CTDGYWLLEGQCLD I DECRYGYCQQL 


180 


Qy 


181 


CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCWTYGSFICRCDPGYELEE 
M 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 ( 1 1 1 1 1 1 1 M 1 1 1 I 1 M 1 1 1 1 1 1 1 11 1 II 1 


240 


Db 


181 


I I I II 1 I 1 I II I II 1 II I 1 I I 1 I 1 I 1 1 1 I I 1 1 1 I II 1 I 1 1 1 1 1 1 [ 1 I 1 1 I 1 1 1 1 I 1 1 I 1 
CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSLICRCDPGYELEE 


240 


Qy 


241 


DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQD1NECEHRNHTCN 

illlSltilllilllillllithMllilllMtlllllliltiltllltillllllii 

DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 


300 


Db 


241 


300 


Qy 


301 


LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 


360 


Db 


301 


lliilllilliilliilllllililllliimiiillilllliliillillllllllii 

LQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS 


360 


Qy 


361 


VPADI FQMQATTRYPGAY Y I FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREI QLDL 
MlllliillillllillillllillllllllllMliiillliiliMillllillll! 


420 


Db 


361 


VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRP I KGPREI QLDL 


420 


Qy 


421 


EM I TVNTVI NFRGSS VI RLR I YVSQ YPF 448 

MIIIIIIMIlililMIIiltllill 




Db 


421 


EMITVNTVINFRGSSVIRLRIYVSQYPF 448 





Search completed: January 9, 2004, 12:43:58 
Job time : 30.8324 sees 



Gen Core version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



January 9, 2004, 12:29:16 ; Search time 16.9736 Seconds 
(without alignments) 
2538.270 Million cell updates/sec 

US-09-674-379A-13 
2533 

1 MPGIKRILTVTILALCLPSP I NFRGSSVI RLRI YVS QYPF 448 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched-- 283308 seqs, 96168 682 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283308 



Database 



PIRJ76:* 
pirl :* 
pir2 :* 
pir3 :* 
pir4 :* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 

Match Length DB 


ID 


Description 


1 


1109.5 


43 


8 


493 


2 


JC5621 


epidermal growth f 


2 


963.5 


38 


0 


387 


2 


138449 


extracellular prot 


3 


746 


29 


5 


685 


2 


S78040 


fibulin, splice fo 


4 


737 


29 


1 


683 


2 


C36346 


fibulin 1 precurso 


5 


733.5 


29 


0 


1221 


2 


A49457 


fibulin-2 precurso 


6 


709.5 


28 


0 


1184 


2 


A55184 


fibulin- 2 precurso 


7 


703.5 


27 


8 


705 


2 


S34968 


fibulin, splice fo 


8 


619 


24 


4 


601 


2 


B36346 


fibulin 1 precurso 


9 


606.5 


23 


9 


689 


2 


T42760 


fibulin, splice fo 


10 


599.5 


23 


7 


712 


2 


T42990 


fibulin 1, splice 


11 


596 


23 


5 


589 


2 


T43210 


fibulin-lD precurs 


12 


577.5 


22 


8 


798 


2 


T22793 


hypothetical prote 


13 


544 


21 


5 


1394 


2 


A3 5 626 


transforming growt 



14 


532.5 


21 


0 


3002 


2 


A47221 


fibrillin 1 precur 


15 


531-5 


21 


0 


2871 


2 


A55567 


fibrillin I - bovi 


16 


523 


20 


6 


1712 


2 


A38261 


masking protein pr 


17 1 


518.5 


20 


5 


2871 


2 


A55624 


fibrillin-1 precur 


18 


518.5 


20 


5 


2907 


2 


A57278 


fibrillin-2 precur 


19 


517.5 


20 


4 


2918 


2 


A54105 


fibrillin-2 precur 


20 


480-5 


19 


0 


1820 


2 


A55494 


latent trans formin 


21 


475 


18 


8 


741 


2 


T46488 


hypothetical prote 


22 


461 


18 


2 


1251 


2 


A57293 


latent transformin 


23 


459 


18 


1 


1620 


2 


T27283 


hypothetical prote 


24 


444 


17 


5 


1574 


2 


T13954 


MEGF6 protein - ra 


25 


392 


15 


5 


886 


2 


A57172 


probable hormone r 


26 


388 


15 


3 


3507 


2 


T34513 


hypothetical prote 


27 


372.5 


14 


7 


1106 


2 


T18739 


hypothetical prote 


28 


371.5 


14 


7 


2471 


2 


A49128 


cell -fate determin 


29 


365.5 


14 


4 


1203 


2 


A49175 


Motch B protein - 


30 


364 


14 


4 


810 


2 


T10756 


Nel-homolog protei 


31 


361 


14 


3 


1081 


2 


T31329 


receptor tyrosine 


32 


359.5 


14 


2 


1964 


2 


T09059 


notch4 - mouse 


33 


359 


14 


2 


511 


2 


T17298 


hypothetical prote 


34 


358 


14 


1 


2555 


2 


A40043 


notch protein homo 


35 


356 


14 


1 


2703 


1 


A24420 


notch protein - fr 


36 


355.5 


14 


0 


2437 


2 


S42612 


transmembrane prot 


37 


346 


13 


7 


1064 


2 


A40136 


fibropellin la - s 


38 


346 


13 


7 


2524 


2 


A35844 


Xotch protein - Af 


39 


344 


13 


6 


2321 


2 


S78549 


notch3 protein - h 


40 


340.5 


13 


4 


673 


2 


A48089 


growth arrest -spec 


41 


337.5 


13 


3 


2318 


2 


S45306 


notch 3 protein - 


42 


337.5 


13 


3 


2531 


2 


S18188 


notch protein homo 


43 


337 


13 


3 


2531 


2 


T31070 


notch homolog - se 


44 


335.5 


13 


2 


674 


2 


155476 


growth potent iat in 


45 


335 


13 


2 


2531 


2 


A46019 


notch- 1 protein - 



ALIGNMENTS 



RESULT 1 
JC5621 

epidermal growth factor-like protein, T16 precursor - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 09-Oct-1997 #sequence_revision 07-Nov-1997 #text_change 05.-NOV-1999 
C; Access ion: JC5621 

R;Ozaki, T. ; Kondo, K. ; Nakamura, Y. ; Ichituiya, S. ; Nakagawara, A.; Sakiyama, S. 
Biochem. Biophys. Res. Commun. 237, 245-25Q, 1997 

A; Title: Interaction of DA41, a DAN-binding protein, with the epidermal growth 
factor-like protein, SU-5) . 

AfReference number: JC5621; MUID: 97415782 ; PMID: 9268694 
A; Access ion: JC5621 
A; Molecule type: mRNA 
A, -Residues: 1-493 <OZA> 

A; Cross-references: DDBJ:D89730; NID:g2429082 ; PIDN: BAA22265 . 1 ; PID:dl023127 ; 
PID:g2429083 

C;Comment; This protein plays a role in the regulation of cell growth by 
interacting with DAN protein through DA41 protein. 
C; Keywords : glycoprotein 

F;l-17/Domain: signal sequence #status predicted <SIG> 



F;28-70, 158-199, 200-237, 238-277, 278-318, 319-359/Region: epidermal growth factor- 
like repeat 

F;249./Binding site; carbohydrate (Asn) (covalent) #status predicted 

Query Match 43.8%; Score 1109.5; DB 2; Length 493; 

Best Local Similarity 42.3%; Pred. No. 1.5e-65; 

Matches 206; Conservative 73; Mismatches 161; Indels 47; Gaps 5; 



Qy 7 ILTVTILALCLPSPGMAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCVNQNGGY 66 

Mh - : MhM I ii liltl MM1M I lit Ml 

Db 9 MLTLALVKSQ VTEET I TYTQCTDGYEWDPVRQQCKD I DECDI VPDACKGGMKCVNH YGGY 68 

Qy 67 LCI PRTNP VYRGPYSNPYSTPYS GPYPAAAPPLSAP 102 

IIMM : II : I MM :: I 

Db 69 LCLPKTAQI I VNNEQPQQETPAAEASSGAATGTI AARSMATSGVI PGGGFIASATAVAGP 128 

Qy 103 NYPT ISRPLI CRFGYQMDESWQCVDVDECATDSHQCNPTQ 142 

1 I : I Ih I I I hill : =1 | I 

Db 129 EVQTGRNNFVI RRNPADPQRI PSNPSHRI QCAAGYEQSEHNVCQDI DECTSGTHNCRLDQ 188 

Qy 143 ICINTEGGYTCSCTDGYWLLEGQCLDIDECRY-GYCQQLCANVPGSYSCTCNPGFTLNED 201 

Mil Mil I II I Mill I II I I I III-' I Hill I 

Db 189 VCIWLRGSPTCHCLPGYQKRGEQCVDIDECSVPPYCHQGCVNTPGSFYCQCNPGFQLAAN 248 

Qy 202 GRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHE 261 

= 1 Mil I I I II llllhh Mil MM hill MUM 
Db 249 NYTCVDINECDASNQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQ 308 

Qy 262 CVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCE 321 

llhil MUM MIIIMIil II MM Mhl h 

Db 309 CVNEPGKFSCMCPQGYQW-RSRTCQDINECETTNE-CREDEMCWNYHGGFRCYPQNPCQ 366 

Qy 322 EPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGAYYIF 381 

Mh- I M II M I 1 III I MM* I • I 1 1 II M 1 1 h 1 1 II I I 

Db 367 DPYVLTSENRCVCPVSNTMCRDVPQSIVYKYMNIRSDRSVPSDIFQIQATTIYANTINTF 426 

Qy 382 Q I KSGNEGREF YMRQTG P I SATLVMTRP I KG PRE IQLDLEM I TVNTVI NFRGS SVI RLRI 441 

'•llllll MIMII hll II:": • Mil • I II M I • - II IIIMI I 
Db 427 RIKSGNENGEFYLRQTSPVSAMLVLVKSLTGPREHIVGLEMLTVSSIGTFRTSSVLRLTI 486 

Qy 442 YVSQYPF 448 

I : I 

Db 487 IVGPFSF 493 



RESULT 2 
138449 

extracellular protein - human 
C; Species: Homo sapiens (man) 

C;Date: 29-May-1998 #sequence_revision 29-May~1998 #text_change 21-Jul-2000 
C; Access ion: 13844 9 
R;Lecka-Csemik, B.; Lumpkin, C.K. 
Mol. Cell. Biol. 15, 120-128, 1995 

A; Title: An overexpressed gene transcript in senescent and quiescent human 
fibroblasts encoding a novel protein in the epidermal growth factor-like repeat 
family stimulates DNA synthesis. 

A;Reference number: 138449; MUID: 95097983 ; PMID: 7799918 



A;AccessiOn: 138449 

A;Status: preliminary,- translated from GB/EMBL/DDBJ 
a /Molecule type: mRNA 
A;ResidueS: 1-387 <RES> 

A; Cross-references: EMBL:U03877; NID:g458227; PIDN:AAA65590 . 1; PID:g458228 
C; Genetics : 
A; Gene: Si -5 

Query Match 38.0%; Score 963.5; DB 2; Length 387; 

Best Local Similarity 48.3%; Pred. No, 4.4e-56; 

Matches 171; Conservative 55; Mismatches 123; Indels 5; Gaps 4; 

Qy 96 APPLSAPNVPTISRPLICRFGYQMDESNQCVDVDECATDSHQCNPTQICINTEGGYTCSC 155 

II I- I I M II: I I I Mil --II I ■•III MM 

Db 38 ADPQRIPSNP--SHRIQCAAGYEQSEHNVCQDIDECTAGTHWCRADQVCINLRGSFACQC 95 

Qy 156 TDGYWLLEGQCLDIDECRY-GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATE 214 

II I MM! II M I MM Mil I :| MM 

Db 96 P PGYQKRGEQCVD I DE CT I P P YCHQRCVWTPGS FYCQCSPGFQLAANN YTCVDI NECDAS 155 

Qy 215 HPCVQTCVNT YGS F I CRCDPGYELEEDGVHCSDMDECS FS EFLCQHECVNQPGTYFCS CP 274 

MM! IMMMMH! MM hill MMMMIIMM I M 

Db 156 MQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCP 215 

Qy 275 PGY I LLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPI RCEEPYLRI SDNRCMC 334 

II - MMIIIIill II MM IIIM i = MM MUM 

Db 216 QGYQW-RSRTCQDINECETTNE-CREDEMCHNYHGGFRCYPRNPCQDPYILTPENRCVC 273 

Qy 33 5 P AENPGCRDQP FT I LYRDMDWSGRS VPAD I FQMQATTRY PGAYY I FQ I KSGNEGREFYM 394 

I I MM MM= | : [ UIIMIIIMIII I IMIIIII llh 

Db 274 PVSNAMCRELPQSIVYKYMSIRSDRSVPSDIFQIQATTIYANTINTFRIKSGNENGEFYL 333 

Qy 395 RQTGPISATLVMTRPI KGPREI QLDLEMI TVNTVINFRGSSVIRLRI YVSQYPF 448 

III IMI Ih = = Ml MIIIMI-: II 111 = 11 MM 

Db 334 RQTSPVSAMLVLVKSLSGPREHIVDLEMLTVSSIGTFRTSSVLRLTIIVGPFSF 387 



RESULT 3 
S78040 

fibulin, splice form C precursor - mouse 

N; Alternate names: basement -membrane protein BM-90 

C; Species: Mus musculus (house mouse) 

C;Date: 24-Jul-1998 #sequence_revision 24-Jul-1998 #text_change 02-Aug-2002 
C;Accession; S78040; S78560; S36440 

R;Pan, T.C.; Kluge, M. ; Zhang, R.Z.; Mayer, U. ; Timpl , R. ; Chu, M.L. 
Eur. J. Biochem. 215, 733-740, 1993 

A;Title: Sequence of extracellular mouse protein BM-90/f ibulin and its calcium- 
dependent binding to other basement -membrane ligands. 
A/Reference number: S34968; MUID: 93358897; PMID: 8354280 
A;Accession: S78040 
A; Molecule type.- mRNA 
A/Residues: 1-685 <PAN> 
A; Cross -references : EMBL:X7 0854 

R;Pan, T.C.; Kluge, M.; Zhang, R.Z.; Mayer, U.; Timpl, R. ; Chu, M.L. 
submitted to the EMBL Data Library, January 1993 

A; Description: Sequence of extracellular mouse protein BM-90 /f ibulin and its 
calcium-dependent binding to other basement membrane ligands. 



A; Reference number: S36440 
A;Accession: S78560 
A /Molecule type: mRMA 
A ; Residues: 1-39, *P' ,41-685 <CHU> 
A; Cross-references : EMBL:X70854 
C; Genetics: 
A;Introns: 568/3 

C; Super family: fibulin-1; EGF homology 

C;Keywords: alternative splicing; basement membrane; calcium binding; 
extracellular matrix; glycoprotein; plasma 
F;l-29/Domain: signal sequence #status predicted <SIG> 
F;30-685/Product : fibulin, splice form C #status predicted <MAT> 
F;98,537,541/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 29.5%; Score 746; DB 2; Length 685; 

Best Local Similarity 36.3%; Pred. No. 1.3e-4l; 

Matches 161; Conservative 64; Mismatches 153; Indels 66; Gaps 17; 

Qy 16 CLPS PGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPV 75 

II MMII I I 1=1 1=11 :| I Mill 
Db 290 CRP KLQCKSGFIQD-ALGWCIDINECLSISAPCPVGQTCINTEGSYTC 336 

Qy 76 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVT)VDECATD 134 

Ml I MM MMMI 

Db 337 QKNVPN CGRGYHLNEEGTRCVDVDECAPP 365 

Qy 135 SHQCNPTQICINTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYS 189 

i I M I I'-: IMM II I M III: 

Db 366 AEPCGKGHHCLNSPGSFRCECKAGPYFDGISRTCVDINECQRYPGRLCGHKCENTPGSFH 425 

Qy 190 CTCNPGFTLNEDGRSCQDVNECATEWPC VQTCVNTYGSF ICRCDPGYELEE - DGVHCSDM 248 

MMMMIIIMIIII HI I i I I MM I Ml Ml! 11 = 

Db 426 CSCSAGFRLSVDGRSCEDVNEC-LNSPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDI 484 

Qy 249 DECS F - - S EFLCQHECVNQ PGT YFCS CP - PG YI LLDDNRS CQDI NECEHRNHTCNLQQTC 305 

Ilh '-I : hi ||:= UN II I : hlllhll I I ■ I I 

Db 485 DECALPTGGHICSYRCINIPGSFQCSCPSSGYRLAPNGRMCQDIDECVTGIHNCSINETC 544 

Qy 306 YNLQGGFKCI DPI RCEEPYLRI SDNRCM CPAENPGCRDQPFTILYRDMDVVSGRSVP 362 

Mil l = |: M I Mill I II i MM = II 

Db 545 FNIQGSFRCLS-FECPENYRRSADTRCARLPC-HENQECPRLPLRITYYHLSFPTNIQVP 602 

Qy 363 AD I FQMQATTRYPGAY Y I FQ I KSGNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEM 422 

I M! II I ".III I I : I = I I I I I : : I • I 

Db 603 AWFRMGPSSAVPGDSMQLAITAGNEEGFFTTRKVSHHSGWALTKPIPEPRDLLLTVKM 662 

Qy 423 - - I T VNT VI NFRGS S VI RLR I YVS 444 

II =1 I =1 Ml 

Db 663 DLYRHGTVSSF VAKLFIFVS 682 



RESULT 4 
C36346 

fibulin 1 precursor, splice form C - human 
N; Alternate names: fibulin C 

N; Contains: fibulin 1 splice form A; fibulin 1 splice form C 
C; Species: Homo sapiens (man) 



C;Date: 19-Apr-1991 #sequence_revision 19-Apr-1991 #text_change 02-Aug-2002 
C;Accession: C36346; A36346; A32826 

R;Argraves, W.S.; Tran, H. ; Burgess, W.H.; Dickerson, K. 
J. Cell Biol. Ill, 3155-3164, 1990 

A; Title: Fibulin is an extracellular matrix and plasma glycoprotein with 
repeated domain structure. 

A;Reference number: A36346; MUID: 91100426 ; PMID:2269669 
A;Accession: C36346 
A; Molecule type: mRNA 
A; Residues: 1-683 <ARG> 

A; Cross -references: GB:X53743; UID:g31413; PIDN :CAA37772 .1; PID:g31419 
A;Accession: A36346 
A; Molecule type: mRNA 
A;Residues: 1-566 <AR2> 

A; Cross-references : GB.-X53741; NID:g31414; PIDN: CAA37770 . 1 ; PID:g31415 
R;Argraves, W.S.; Dickerson, K. ; Burgess, W.H.; Ruoslahti, E. 
Cell 58, 623-629, 1989 

A; Title: Fibulin, a novel protein that interacts with the fibronectin receptor 
beta-subunit cytoplasmic domain. 

A; Reference number: A32S26; MUID -. 89354537 ; PMID-. 2527614 

A; Accession: A32826 

A; Molecule type: protein 

A;Residues: 30-35, 'SX' ,38-40, l SH' ,43-44 <AR3> 
C; Genetics: 

A; Gene : GDB.-FBLNl; FBLN 

A; Cross -references: GDB:278285; OMIM; 135820 

A;Map position: 22ql3 .3-22ql3 .3 

C; Super family: fibulin-1; EGF homology 

C,- Keywords: alternative splicing; glycoprotein 

F; 1-29/Domain: signal sequence #status predicted <SIG> 

F; 30 -683 /Product : fibulin 1 splice form C #status predicted <MAT> 

F; 18 0-214 /Domain: EGF homology <EGF> 

F;485-523/Domain: EGF homology <EGF1> 

F;98,535,539/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 29.1%; Score 737; DB 2; Length 683; 

Best Local Similarity 35.4%; Pred. No. 4.9e-41; 

Matches 157; Conservative 67; Mismatches 154; indels 66; Gaps 17; 

Qy 16 CLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPV 75 

II : II :|l I I 1 = 11=11 =1 I hi III 
Db 288 CRP KLQCKSG F I QD - ALGNCI D INECLS I SAPCP IGHTC I NTEGS YTC 334 

Qy 76 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVDVDECATD 134 

=11 I 11=0 =11111111 

Db 335 QKNVPW CGRGYHLNEEGTRCVDVDE CAPP 363 

Qy. 135 SHQCNPTQICINTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYS 189 

=1 1=1= I = I I ||: : hl-llll I I I III 

Db 364 AEPCGKGHRCVNSPGSFRCECKTGYYFDGISRMCVDVNECQRYPGRLCGHKCENTLGSYL 423 

Qy 190 CTCNPGFTLNEDGRSCQDVWECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDM 248 

1=1= II 1= lllihMII- =11 I I I |||: I I IN : III I I: 
Db 424 CSCSVGFRLSVDGRSCEDINECSS-SPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDI 482 

Qy 249 DECSF--SEFLCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTC 305 

111= =i = hi li== Mil II 1 = hlllhll I I- =11 



Db 483 DECALPTGGHICSYRCIMIPGSPQCSCPSSGYRIAPNGRNCQDIDECVTGIHNCSINETG 542 

Qy 306 YNLQGGPKCIDPI RCEEPYLR I SDNRC -MCPA- ENPGCRDQPFTILYRDMDWSGRSVPA 363 

:hll !--h I I I h II I II I MM = II 
Db 543 FNIQGAFRCL-AFECPEWYRRSAATRCERLPCHENRECSKLPLRITYYHLSFPTNIQAPA 601 

Qy 364 DI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRP I KGPREI QLDLEMI 423 

= |:| - II I III I h i I : =hl: ll = : 
Db 602 WFRMGPSSAVPGDSMQLAITGGNEEGFFTTRKVSPHSGWALTKPVPEPRDL LL 656 

Qy 424 TVNTVINFRG SSVIRLRIYVS 444 

II ^ I M :| Ml 
Db 657 T VKMDLSRHGT VS SF VAKLF I FVS 68 0 



RESULT 5 
A49457 

fibulin-2 precursor - mouse 

C; Species: Mus musculus {house mouse) 

C;Date: 03-May-1994 #sequence_revision 03-May-1994 #text_change 08-Sep-2002 
C;Accession: A49457; S74095 

R;Pan, T.C.; Sasaki, T.; Zhang, R.Z.; Faessler, R. ; Timpl, R. ; Chu, M.L. 
J. Cell Biol. 123, 1269-1277, 1993 

A; Title: Structure and expression of fibulin-2, a novel extracellular matrix 

protein with multiple EGF-like repeats and consensus motifs for calcium binding. 

A;Reference number: A49457; MUID: 94064787 ; PMID: 8245130 

A; Accession: A49457 

A; Status : prel iminary 

A; Molecule type-. mRNA 

A; Residues: 1-1221 <PAN> 

A; Cross-references: GB:X75285; NID:g437046; PIDN: CAA53040 . 1 ; PID:g437047 
R;Sasaki, T. ; Mann, K. ; Murphy, G. ; Chu, M.L.; Timpl, R. 
Eur. J. Biochem. 240, 427-434, 1996 

A;Title: Different susceptibilities of fibulin-1 and fibulin-2 to cleavage by 
matrix metalloproteinases and other tissue proteases. 
A;Reference number: S74094; MUID : 96439073 ; PMID:8841408 
A;Accession: S74095 
A;Molecule type: protein 

A;ResidueS: 236-238, 'X 1 ,240-247;260-275;336-344, 'L 1 , 346-361 ; 405-426 ; 566- 
568, 'EM' ,569-589;653-666;784-737, 'X' ,789-794;S41-844, 'X' ,846-850/883- 
892, 'X' ,894-894; 930-935, 'X' ,937-939 <SAS> 

C; Super family: fibulin-2; EGF homology; von Willebrand factor type C repeat 
homology 

C; Keywords: calcium binding; duplication; extracellular matrix; glycoprotein; 
homot rimer 

F,-942-978/Domain: EGF homology <EGF> 

Query Match 29.0%; Score 733.5; DB 2; Length 1221; 

Best Local Similarity 35.3%; Pred. No. 1.4e-40; 

Matches 146; Conservative 60; Mismatches 14 9; Indels 59; Gaps 11; 

Qy 23 AQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSN 82 

h 0 :|| I i |:|hli 1 II I : I III 
Db 829 ARQRCMDGF-LQDPEGNCVDINECTSLLEPCRSGFSCINTVGSYTC 873 



Qy 



83 PYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVDVDECATDSHQCNPT 141 

II 'I •■•MINI I hi 



Db 


874 


QRNPLVCX3RGYHANEEGSECVDVNECETGVHRCGEG 


909 


Qy 


142 


QIC! NTEGGYTCS CTDGYW - - LLEGQCLD I DECRYG YCQQLCANVPGS YS CTCNPGF 


196 


Db 


910 


hi 1 1 1 1 1 1= 1 = 1-1! II 1 1 llll |:| |] 
QLCYNLPGSYRCDCKPGFQRDAFGRTCIDVNECWVSPGRLCQHTCENTPGSYRCSCAAGF 


969 


Qy 


197 


TLNEDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYEI/EEDGVHCSDMDECS - FSE 

1 Ih hlllll 1 llll ill: 1 1 I' III hhllh : 
LLAADGKHCEDVNECETRR - CSQECANI YGSYQCYCRQGYQLAEDGHTCTDI DECAQGAG 


255 


Db 


970 


1028 


Qy 


256 


FLCQHE CVNQPGTYFCS CP - PGY I LLDDNRSCQD I NECEHRNHTCNLQQTCYNLQGGFKC 


314 


Db 


1029 


11 III 11 = 1 hll II - = 111 = 1-11 1 h :||:|:|| hi 
I LCTFRCVNVPGS YQCACPEQGYTMMANGRS CKDLDECALGTHNCS EAETCHN I QGS FRC 


1088 


Qy 


315 


TnDrDfPtTDVT DTcnMDPiwr'DaPMDrr'Dn _ nopTTr VDniwnTnTOr , DCtrDRnTE , nM 
J.Dr'lKl-Eilih'l JjKlbDJNKUML.FA&iNFtjL.KlJ 1 liJlKUMJUVVbtjKoVt'AJJlt'yW 

1 .hhl =1 1 hi 1 I = - H III Ih: 
L-RFDCPPNYVRVSQTKC ERTTCQDITECQTSPARITHYQLNFQTGLLVPAHIFRI 


368 


Db 


1089 


1143 


Qy 


369 


QATTRYPGAYY I FQI KSGNEGREFYMRQTGPI SATLVMTRP I KGPRE I QLDLEM 422 




Db 


1144 


= 1 1 III 1 h : : • ! : 1 h 1 |: I 1 
GPAPAFAGDT I SLTITKGNEEGYFVTRRLNAYTGWSLQRSVLEPRDFALDVEM 1197 





RESULT 6 
A55184 

fibulin-2 precursor - human 

N;Alternate names: protein DKFZp586A1519 . 1 

C; Species: Homo sapiens (man) 

C;Date: 27-Jan-1995 #sequence_revision 27-Jan-1995 #text_change 08-Sep-2002 
C; Accession: A55184; T08744 

R;Zhang, R.Z.; Pan, T.C.; Zhang, Z.Y.; Mattei, M.G.; Timpl, R. ; Chu, M.L. 
Genomics 22, 425-430, 1994 

A,- Title: Fibulin-2 (FBLN2) : human cDNA sequence, mRNA expression, and mapping of 
the gene on human and mouse chromosomes . 

A; Reference number: A55184; MUID : 95104855 ; PMID: 78 0623 0 
A;ACCession: A55184 
A; Status: preliminary 
A;Molecule type: mRNA 
A;Residues: 1-1184 <ZHA> 

A; Cross-references : GB:X82494; NID:g575232 ; PIDN: CAA57876 . 1 ; PID:g575233 

R;Wambutt, R. ; Heubner, D. ; Mewes, H.w.; Gassenhuber, J.; Wiemann, S. 
submitted to the Protein Sequence Database, March 1999 
A;Reference number: Z16471 
A;Accession; T08744 
A; Molecule type: mRNA 

A ; Res idues : 656-719,' QDECLMGAHDCSRRQFCVNTLGSFYCVNHTVLCADGYI LNAHRKCVD ',720- 

853, 'T' ,855-1184 <WAM> 

A /Cross-references : EMBL:AL050095 

A; Experimental source: adult uterus; clone DKFZp586A1519 

C; Genetics : 

A; Gene: GDB : FBLN2 

A; Cross-references : GDB: 293037; OMIM: 135821 
A;Map position: 3p25-3p24 
A; Mote : DKFZp586A1519 . 1 

C; Super family: fibulin-2; EGF homology; von Willebrand factor type C repeat 
homology 

C;Keywords: alternative splicing; extracellular matrix 



F; 1-27/Domain: signal sequence #status predicted <SIG> 
F;28-1184/Product : fibulin-2 protein #status predicted <MAT> 
F;905-941/Domain: EGF homology <EGF> 

Query Match 28.0%; Score 709.5; DB 2; Length 1184; 

Best Local Similarity 34.3%; Pred. No. 5.1e-39; 

Matches 146; Conservative 61; Mismatches 156; Indels 63; Gaps 12; 

Qy 15 LCLPSPGN AQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI P 70 

II : I I H ' I I hlhll - IN ; I ill 

Db 780 LCQNTKGS F YCQARQRCMDGF - LQDPEGNCVD I NE CTSLS E PCR PGFS C I NTVGS YTC - - 836 

Qy 71 RTNPWRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQ-MDESNQCVDVD 129 

Mil II h HIM: 
Db 837 QRNPLI CARGYHASDDGAKCVDVN 860 

Qy 130 ECATDSHQCNPTQI CINTEGGYTCSCTDGYW- -LLEGQCLDI DECRYG YCQQLCANV 184 

II I hi hi I I II I h hh I II I I 

Db 861 ECETGVHRCGEGQVCHNLPGSYRCDCKAGFQRDAFGRGCIDVNECWASPGRLCQHTCENT 920 

Qy 185 PGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVH 244 

III hi II I Ih hlllll : I I I I IT I I hi III 
Db 921 LGS YRCS CASGFLLAADGKRCEDVNECEAQR- CSQECANI YGS YQCYCRQG YQLAEDGHT 979 

Qy 245 CSDMDECS-FSEFLCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQ 302 

hhllh : II hi Ihl hll II : : llhl-ll I h 

Db 98 0 CTDIDECAQGAGILCTFRCLNVPGSYQCACPEQGYTMTANGRSCKDVDECALGTHNCSEA 1039 

Qy 303 QTCYNLQGGFKC I D P I RCEEP YLR I SDNRCMCPAEN PGCRD QPFTILYRDMDW 356 

1 = 1= I h-l =1 I I I I I = - 

Db 104 0 ETCHNIQGSFRCL-RFECPPNYVQVSKTKC ERTTCHDFLECQNSPARITHYQLNFQ 1094 

Qy 357 SGRSVPADI FQMQATTRYPGAYYIFQI KSGNEGREFYMRQTGPI SATLVMTRPI KG PRE I 416 

= | III Ih: : I I III I h = = I = 11 = 

Db 1095 TGLLVPAHI FRI GPAPAFTGDTI ALNI I KGNE EGYFGTRRLNAYTG WYLQRAVLE PRDF 1154 

Qy 417 QLDLEM 422 

ihll 

Db 1155 ALDVEM 1160 



RESULT 7 
S34968 

fibulin, splice form D precursor - mouse 

N;Alternate names: basement -membrane protein BM-90; calcium-binding protein BM*- 
90 

C; Species: Mus mus cuius (house mouse) 

C;Date: 10-Dec-1993 #sequence_revision 10 -Nov- 19 95 #text_change 02-Aug-2 002 
C;Accession: S34968; S36441; S13814 

R;Pan ; T.C.; Kluge, M.; Zhang, R.Z.; Mayer, U. ; Timpl, R. ; Chu, M.L. 
Eur. J. Biochem. 215, 733-740, 1993 

A; Title: Sequence of extracellular mouse protein BM-90/f ibulin and its calcium- 
dependent binding to other basement -membrane ligands. 
A; Reference number: S34 968; MUID: 93358897; PMID: 8354280 
A;Accession: S34968 
A;Molecule type: mRNA 
A;Residues: 1-705 <PAN> 



R;Pan, T.C.; Kluge, M.; Zhang, R.Z.; Mayer, U. ; Timpl, R . ; Chu, M.L. 
submitted to the EMBL Data Library, January 1993 

A; Description: Sequence of extracellular mouse protein BM-90/f ibulin and its 

calcium-dependent binding to other basement membrane ligands. 

A;Reference number: S36440 

A; Accession: S36441 

A;Molecule type: mRNA 

A; Residues: 1-39, 'P* ,41-705 <PAW> 

A; Cross-references: EMBL:X70854; NID:g396820; PIDN: CAA50207 . 1 ; PID:g396821 
A; Experimental source: cell -line F9 teratocarcinoma 
R,-Kluge, M.; Mann, K. ; Dziadek, M . ; Timpl, R. 
Eur. J. Biochem. 193, 651-659, 1990 

A; Title: Characterization of a novel calcium-binding 90-kDa glycoprotein (BM-90) 
shared by basement membranes and serum. 

A;Reference number: S13814; MUID: 91065369 ; PMID: 2249686 
A;Accession: S13814 
A;Molecule type: protein 

A;Residues: 28;31-49, 'X' ,51-53; 'XX' , 110-117; 231-240 , 1 X ' , 242 -243 ; 339 -3 62 , 'S' ,364- 
387;434-439;469-476; 'Q ' ,554-557, 'Q' , 559-563 ; 574-581 <KLU> 
C; Super family: fibulin-1; EGF homology 

C;Keywords: alternative splicing; basement membrane; calcium binding; 

extracellular matrix; glycoprotein; plasma 

F; 1-29 /Domain: signal sequence # status predicted <SIG> 

F;30-705/Product : f ibulin, splice form D #status predicted <MAT> 

F;98, 537, 541/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 27.8%; Score 703.5; DB 2; Length 705; 

Best Local Similarity 34.9%; Pred. No. 7.9e-39; 

Matches 162; Conservative 67; Mismatches 156; Indels 79; Gaps 22; 

Qy 16 CLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPV 7 5 

II II =11 I I hlhll : hi I I I 
Db 290 CRP KLQCKSGF I QD - ALGNC I D I NE CLS I SAPCPVGQTC I NTEGS YTC 336 

Qy 76 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQM-DESNQCVDVDECATD 134 

•■ M II -. UN I 

Db 337 QKNVPN CGRGYHLNE EGTRCVDVDE CAPP 365 

Qy 135 SHQCNPTQI CINTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYS 189 

= I | : | | |:: : |:[|:|! j| | | | |||: 

Db 366 AEPCGKGHHCLNSPGSFRCECKAGFYFDGISRTCVDINECQRYPGRLCGHKCENTPGSFH 425 

Qy 190 CTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDM 248 

hh II l llllhlllll : I I I I I llh I I Ihl : III I 
Db 426 CSCSAGFRLSVDGRSCEDVNEC-LNSPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDI 484 

Qy 249 DECSF--SEFLCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTC 305 

llh , : hi lh = I I I I II I : hlllhll I h = : I I 

Db 485 DECALPTGGH ICSYRCINI PGS FQCS CPS SG YRLAPNGRNCQD I DE CVTG I HNCS I NETC 544 

Qy 306 YNLQGGF KC I DP I RCEE PYLR I S DN RCM- -CPAENPGC-RDQPFTILYRDM 353 

= 1 = 11 hh i I I I : I Ih I = I [| h : : 

Db 545 FN I QGSFRCLS - FECPENYRRSADTFRQEKTDTVRCI KSCRPNDEACVRDPVHTVSHTVI 603 

Qy 354 DWSGRSV- - PADI FQMQATT-RYPG- -AYYI FQI KSGNEGREFYM RQTGPISATLV 405 

= = I I : I - I I II I III II I , 

Db 604 SLPTFREFTRPEEI IFLRAVTPLYPANQADI IFDITEGNLRDSFDI IKRYEDGMTVGWR 663 



Qy 4 06 MTR P I KGPRE I QLDLEM - 1 TVNTVI NFRGS S VI RLR I YVS QY P F 448 

III II I III = i = = I =h = 1 = 11 = 1 I 

Db 664 QWPIVGPFYAVLKLEMNYVLGGWSHR--NVWVHIFVSEYWF 705 



RESULT 8 
B36346 

fibulin 1 precursor, splice form B - human 
C;Species: Homo sapiens (man) 

C;Date: 19-Apr-1991 #sequence_revision 19 -Apr- 1991 #text_change 02-Aug-2002 
C; Access ion: B36346 

R;Argraves, W.S.; Tran, H. ; Burgess, W.H.; Dickerson, K. 
J. Cell Biol. Ill, 3155-3164, 1990 

A; Title: Fibulin is an extracellular matrix and plasma glycoprotein with 
repeated domain structure. 

A /Reference number: A36346; MUID: 91100426; PMID: 2269669 
A, -Access ion: B36346 
A; Molecule type: mRNA 
A;Residues: 1-601 <ARG> 

A; Cross -references: GB:X53742; NID:g31416; PIDN: CAA37771 . 1 ; PID:g31417 
C; Genetics : 

A; Gene: GDB : FBLN1 ; FBLN 

A; Cross -references : GDB: 2782 85; OMIM: 135820 

A ; Map position: 22ql3 . 3-22ql3 . 3 

C; Super family: fibulin-1; EGF homology 

C;Keywords: alternative splicing 

F; 180-214/Domain: EGF homology <EGF1> 

F;485-523/Domain: EGF homology <EGF> 

Query Match 24.4%; Score 619; DB 2; Length 601; 

Best Local Similarity 37.7%; Pred. No. 2.3e-33; 

Matches 121; Conservative 47; Mismatches 97; Indels 56; Gaps 13; 

Qy 16 CLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPV 75 

M : I I = I I I I hlhll :| I hi I I I 
Db 288 CRP KLQCKSGF I QD - ALGNCI DI NECLS I SAPCP I GHTCINTEGS YTC 334 

Qy 76 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVDVDECATD 134 

= 11 i II : :| : I j j I [ | f j 

Db 335 QKNVPN CGRG YHLNE EGTRCVD VDE CAP P 363 

Qy 135 SHQCNPTQI CINTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYS 189 

= 1 hl= I = I I Ih : I : I = = I I II I I I III 

Db 364 AE P CGKGHR CVNS PGS FRCECKTG YYFDG I S RMCVDVNECQR YPGRLCGHKCENTLGS YL 423 

Qy 190 CTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDM 248 

M- || h li I ■'■ , : : [ | M I Mh I I I I '= I : I I , 
Db 424 CS CS VGFRLS VDGRS CED I NECSS - S P CS QE CANVYGS YQCYCRRGYQLSDVDGVTCED I 482 

Qy 249 DE CS F - - S EFLCQHECVNQ PGTYF CS CP - PG Y I LLDDNRS CQD I NECEHRNHTCNLQQTC 305 

111= =1 = 1 = 1 II- Mil II I = hlllhll | h= :|l 

Db 483 DECALPTGGHICSYRCINI PGSFQCSCPSSGYRIAPNGRNCQDI DECVTGI HNCS INETC 542 

Qy 306 YNLQGGFKCIDPIRCEEPYLR 326 

= 1 = 11 hh I I I I 

Db 543 FNIQGAFRCL-AFECPENYRR 562 



RESULT 9 
T42760 

fibulin, splice form D precursor - Caenorhabditis elegans 
C;Species: Caenorhabditis elegans 

C;Date: ll-Jan-2000 #sequence_revision ll-Jan-2000 #text_change 23-Sep-2002 
C;Accession: T42760 

R;Barth, J.L.; Argraves, K.M.; Roark, E.F.; Little, CD. ; Argraves, W.S. 
submitted to the EMBL Data Library, February 1998 

A; Description: Isolation of chicken and nematode fibulin-1 homologs and 
characterization of the nematode fibulin-1 gene. 
A; Reference number: Z22267 
A;Accession: T42760 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A;Residues: 1-689 <BAR> 

A; Cross-references: EMBL:AF051401; PIDN: AAC28321 . 1 
C; Genetics: 

A; Note : FBLN1 

Query Match 23.9%; Score 606.5; DB 2; Length 689; 

Best Local Similarity 27.3%; Pred. No. 1.7e-32; 

Matches 159; Conservative 53; Mismatches 163; Indels 207; Gaps 20; 

Qy 15 LCLPSPG -NAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTN 7 3 

II I = I Hill MINI ! I II 
Db 167 LCHDRGGBKA^CSCRSGFDLAPDGMACVDIDECATLMDDCLESQRCLNTPGSFKCI 222 

Qy 74 PVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD-ESNQCVDVDECA 132 

III Hill: ' I Mill 
Db 223 RTLSCGTGYAMDSETERCRDVDECN 247 

Qy 133 TDSHQCNPTQI CINTEGGYTC SCTDGYVJLLEGQCLDIDE 171 

II I I I Ihl I 1 =1 :|h I I Mil 

Db 248 LGSHDCGPLYQCRNTQGSYRCDAKKCGDGELQNPMTGECTSITCPNGYYPKNGMCNDIDE 3 07 

Qy 172 CRYGY CQQ LCA N 183 

I h III Ml I 

Db 308 CVTGHNCGAGEECVNTPGSFRCQQKGNLCAHGYEVNGATGFCEDVNECQQGVCGSMECIK 367 

Qy 184 VPGSYSCTCNPGFTLNE 2 00 

•Ihl I I Ih h 

Db 368 LPGTYKCKCGPGYEFNDAKKRCEDVDEC I KFAGHVCDLSAECI NTI GSFECKCKPGFQLA 427 
Qy 201 -DGRSCQDVNECATE-NPCVQTCVWTYGSFICRCDPGYELEEDGVHCSDMDECSF S 254 

III MINI I II III , I 1 1 h I II I Mill 

Db 428 SDGRRCEDVNECTTGIAACEQKCVNIPGSYQCICDRGFALGPDGTKCEDIDECSIWAGSG 487 

Qy 255 EFLCQHECVNQ PGTYF CS CP PG Y I LLDDNRS CQD I NECEHRNHTCNLQQTC YNLQGGFKC 314 

II hi hi I I I I I I : I hi ■ I : I I I III 

Db 488 NDLCMGGCINTKGSYLCQCPPGYKIQPDGRTCVDVDECA-MGECAGSDKVCVNTLGSFKC 546 

Qy 315 IDPIRCEEPYLRISDNR • C--MCPAENPGC-RDQPFTILYRDMDWSGRSV-- 361 

M h 1 h I •■! h- I :|h I I 

Db 547 -HSIDCPTNYIHDSLNKNQIADGYSCIKVCSTEDTECLGNHTREVLYQFRAVPSLKTIIS 605 



Qy 362 PADI FQMQATTRYPGAYYI PQI KSGNEGREFYMRQTGP I SATLVMTRPI K 411 

Db 606 P I EVSRI VTHMGVPFSVDYNLDY VGQRHFRI VQERNI G IVQLVKPIS 652 

Qy 412 GPRE I QLDLEM I TVNTVT NFRGS S VI R LRIYVSQYPF 448 

I! Mill I i 11=111 

Db 653 GP TVETIKVNIHTKSRTGVILAFNEAI IEISVSKYPF 689 



RESULT 10 
T42990 

fibulin 1, splice form C precursor - Caenorhabditis elegans 
C;Species: Caenorhabditis elegans 

C;Date: ll-Jan-2000 #sequence_revision ll-Jan-2000 #text_change 08-Sep-2002 
C; Access ion: T42990 

R;Barth, J.L.; Argraves, K.M.; Roark, E.P-; Little, CD. ; Argraves, W.S. 
submitted to the EMBL Data Library, February 1998 

A; Description: Isolation of chicken and nematode fibulin-1 homologs and 
characterization of the nematode fibulin-1 gene. 
A; Reference number: Z22267 
A;Accession: T42990 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A, -Molecule type: mRNA 
A;Residues: 1-712 <BAR> 

A; Cross -references: EMBL : AF0514 02 ; PIDN:AAC28322 . 1 
C; Genetics : 
A; Gene: FBLN1 

C; Super family: fibulin-1; EGF homology 

C;Keywords: alternative splicing; basement membrane; extracellular matrix 

Query Match 23.7%; Score 599.5; DB 2; Length 712; 

Best Local Similarity 27.8%; Pred. Mo. 5.1e-32; 

Matches 155; Conservative 47; Mismatches 153; Indels 203; Gaps 19; 

Qy 15 LCLPSPG-NAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTN 73 

II I : I Mil hlllll 1= : I 1 = 1 I = II 
Db 167 LCHDRGGEKVECS CRSGFDLAPDGMAC VD I DE CATLMDDCLES QRCLNTPGS FKC I 222 

Qy 74 PVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD-ESNQCVDVDECA 132 

III II II h :| HIM 
Db 223 RTLSCGTGYAMDSETERCRDVDECN 247 

Qy 133 TDSHQCNPTQICINTEGGYTC SCTDGYWLLEGQCLDIDE 171 

II I I I Ml I I = 1 :||: I I III! 

Db 248 LGSHDCGPLYQCRNTQGSYRCDAKKCGDGELQNPMTGECTS I TCPNGYYPKNGMCNDI DE 3 0 7 

Qy 172 CRYGY CQQ---LCA N 183 

lh- III III I 

Db 308 CWGHNCGAGEECV1SITPGSFRCQQKGNLCAHGYEVNGATGFCEDVWECQQGVCGSMECIW 367 

Qy 184 VPGS YS CTCNPGFTLNE 2 00 

=11=1 I I 11= 1= 

Db 368 LPGTYKCKCGPGYEFNDAKKRCEDVDECI KFAGHVCDLSAECINTI GSFECKCKPGFQLA 427 
Qy 201 -DGRSCQDWECATE-NPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSF S 254 

III 1 = 11111 I I I III 11= Ml 1= I II I Mill 

Db 428 SDGRRCEDVNECTTGI AACEQKCVNI PGSYQCI CDRGFALGPDGTKCEDIDECSIWAGSG 487 



Qy 255 EFLCQHBCVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGPKC 314 

II hi hi. I Hill : I hi l-M =111111 
Db 488 NDLCMGGCINTKGSyLCQCPPGYKIQPDGRTCVDVDECA-MGECAGSDKVCVNTLGSFKC 546 

Qy 315 I DP I RCEEP YLRI SDNR CMCPAENPGCRDQPFTI LYRDMDWSGRSVP 362 

I I h I h I I I I I : h : = hli 

Db 547 -HSI DCPTNYI HDSLNKNRCNRQPSACGLPEE - - - CSKVPLFLTYQFI SL - - ARAVPI SS 600 

Qy 363 AD I FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP - - 1 SA 402 

II l=.| I II : I II II 

Db 601 HRPAI TLFKVSAPNHADTEVNFELQLKTT I VGAPNVL PA I RAN FLLQKGEKRNSA 655 

Qy 4 03 TLVMTRP I KG PRE I QLDL 420 

: : : (I: -| t 
Db 656 WTLRDSLDGPQTVKLQL 673 

RESULT 11 
T43210 

fibulin-lD precursor - Caenorhabditis elegans (fragment) 
C; Species: Caenorhabditis elegans 

C;Date: ll-Jan-2000 #sequence_revision ll-Jan-2000 #text_change 08-Sep-2002 
C;Accession: T43210 

R;Barth, J.L.; Argraves, K.M.; Roark, E.F.; Little, CD.; Argraves, w.S. 
submitted to the EMBL Data Library, June 1998 

A; Description: Identification of chicken and C. elegans fibulin-1 homologs and 
characterization of the C. elegans fibulin-1 gene. 
A; Reference number: Z22337 
A;Accession: T43210 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-589 <BAR> 

A; Cross-references : EMBL:AF0704 77; PIDN:AAC24 035.1 
C;Genetics : 

A; Note: intron positions not resolved (incomplete sequence) 
C; Super family: fibulin-1; EGF homology 

Query Match 23.5%; Score 596; DB 2; Length 589; 

Best Local Similarity 27.3%; Pred. No. 7.4e-32; 

Matches 159; Conservative 53; Mismatches 163; Indels 208; Gaps 21; 

Qy 15 LCLPSPG -NAQAQCTNGFDLDRQSGQCLD - 1 DECRTI PEACRGDMMCVNQNGGYLC I PRT 7 2 

II I ' I =1111 hi llll h : I 1=1 i = II 
Db 66 LCHDRGGEKVECSCRSGFDLAPDGMACVDHIDECATLMDDCLESQRCLNTPGSFKCI 122 

Qy 73 NPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD-ESNQCVDVDEC 131 

III II II h ' Mill 

Db 123 RTLSCGTGYAMDSETERCRDVDEC 146 

Qy 132 ATDSHQCNPTQI CINTEGGYTC SCTDGYWLLEGQCLDID 170 

llll I I, ' I I I I I III 

Db 147 NLGSHDCGPLYQCRNTQGSYRCDAKKCGDGELQNPMTGECTSITCPNGYYPKNGMCNDID 206 

Qy 171 ECRYGY CQQ---LCA 182 

i h III Ml 

Db 2 07 ECVTGHNCGAGEECVNTPGSFRCQQKGNLCAHGYEVNGATGFCEDVNECQQGVCGSMECI 266 



Qy 183 NVFGSYSCTCNPGFTLNE . 200 

h I ! I I I Ih h 

Db 267 NLPGTYKCKCGPGYEFNDAKKRCBDVDECIKFAGHVCDLSAECINTIGSFECKCKPGFQL 326 

Qy , 201 --DGRSCQDVMECATE-NPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSF 253 

111 hlllll I i i III ih Ml h I II I Mill 

Db 327 ASDGRRCEDVNECTTGI AACEQKCVNI PGSYQCICDRGFALGPDGTKCEDI DECSIWAGS 386 

Qy 254 SEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFK 313 

II hi hi I Mill : I hi h:|| : M I II 

Db 387 GNDLCMGGCINTKGSYLCQCPPGYKIQPDGRTCVDVDECA-MGECAGSDKVCVNTLGSFK 445 

Qy 314 CI DP I RCEEP YLRI SDNR C - -MCPAENPGC- RDQPFTI LYRDMDWSGRSV- 361 

I M h I h I H h I : Oh I I 

Db 446 C-HSIDCPTNYIHDSLNKNQIADGYSCIKVCSTEDTECLGNHTREVLYQFRAVPSLKTII 504 

Qy 362 -PADI FQMQATTRY PGAYY I FQI KSGNEGREF YMRQTGP I S ATLVMTRP I 410 

| :: | : | | : | j : : :j| 

Db 505 SPI EVSRI VTHMGVPFSVDYNLDYVGQRHFRI VQERNI G IVQLVKPI 551 

Qy 411 KG PRE I QLDLEM I TVNTVI NFRGS S VI R LRI YVSQYPF 448 

II :| I II I : I Ihlll 

Db 552 SGP TVETIKVNIHTKSRTGVILAFWEAIIEISVSKYPF 589 



RESULT 12 
T22793 

hypothetical protein F56H11.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 02~Aug-2002 
C;Accession: T22793; T24489 
R; Wilkinson, J. 

submitted to the EMBL Data Library, January 1996 
A; Reference number: Z19616 
A; Accession: T22793 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-798 <WIL> 

A; Cross-references : EMBL: Z68749 ; PIDN:CAA92962 . 1; GSPDB:GM00022 ; CESP : F56H11 . 1 
A; Experimental source: clone F56H11 
R; Lloyd, C. 

submitted to the EMBL Data Library, December 1995 
A; Reference number: Zl 98 97 
A; Accession: T244 89 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Residues: 1-798 <WI2> 

A; Cross-references: EMBL:Z68219; PIDN : CAA92483 . 1; GSPDB:GM00022 ; CESP : F56H11 . 1 

A; Experimental source: clone T05A1 

C; Genetics: 

A; Gene: CESP: F56H11 . 1 

A; Map position: 4 

A;IntronS: 14/1; 92/3; 144/1; 195/1; 281/1; 325/2; 371/2; 390/1; 437/1; 471/3; 
498/3; 607/1; 649/2; 718/1 
C;Superfamily : fibulin-1; EGF homology 



Query Match 22.8%; Score 577.5; DB 2; Length 798; 

Best Local Similarity 26.7%; Pred. No. 1.6e-30; 

Matches 162; Conservative 50; Mismatches 180; Indels 215; Gaps 23; 

Qy 15 LCLPSPG-NAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCVNQNGGYLC 68 

II I = I ONI hi :|| I I III llhl 

Db 167 LCHDRGGEKVECSCRSGFDLAPDGMACVDRNECLTRQSPCTQSEDCVNTIGGYICQRRIS 226 

Qy 69 - - IP RTNFPVYRGP YSNPYSTPYSGPYPAAAPP 98 

- I I I ■ i 'III I I I • 

Db 227 RLVPHRHRANRIGNAPRRMRDDPYSR--AGEYREASQANTEFGCPMGWLFQHGHCVDIDE 284 

Qy 99 LSAPNYPTTSRPLI CRFGYQMD ESNQCV 126 

hi I I I II II 111 

Db 285 CATLMDDCLESQRCLNTPGSFKCI RTLSCGTGYAMDSETERNNCFLI ILNNTFNCKYFFV 344 

Qy 127 -DVDECATDSHQCNPTQI CINTEGGYTCS 154 

Hill II I I I Ihl I I 
Db 345 EDVDECNLGSHDCGPLYQCRNTQGSYRCDAKKCGDGELQNPMTGEYIDECVTGHNCGAGE 404 

Qy 155 CTDG YWL - - LEGQCLDI DECRYG YCQQL - CANVPGS YS CTCNP 194 

I, || I- ', I [ : I hlhl I I I 

Db 405 ECVNTPGSFRCQQKGNLCAHGYEVNGATGFCEDVNECQQGVCGSMECINLPGTYKCKCGP 464 

Qy 195 GFTLNE DGRSCQDVNE 210 

h h 111 hllii 

Db 465 GYEFNDAKKRCEDVDECIKFAGHVCDLSAECINTIGSFECKCKPGFQLASDGRRCEDVNE 524 

Qy 211 CATE - N PCVQTCVNT YGS F I CRCD PG YELEEDGVHCSDMDECS F SEFLCQHECVNQ 265 

II I Mil Ih I II h I II I hllll II hi 

Db 525 CTTGIAACEQKCVNI PGSYQCI CDRGFALGPDGTKCEDIDECSIWAGSGNDLCMGGCINT 584 

Qy 266 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 325 

hi I ' : I hi I ■ i =11 I Ml II 

Db 585 KGSYLCQCPPGYKIQPDGRTCVDVDECA-MGECAGSDKVCVNTLGSFKC-HSIDCPTNYI 642 

Qy 326 RISDNR CMCPAENPGCRDQPFTILYRDMDWSGRSVP 362 

I h III I I : h : : hll 
Db 643 HDS LNKNRCNRQPSACGL PEE CSKVPLFLT YQF I SL - - ARAVP I SSHR PA I TLFKVS 697 

Qy 363 ADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP - - 1 SATLVMTRPI KGP 413 

II |::| ,1 II : I :: II I I : : li 

Db 698 APNHADTEVNFELQLKTTIVGAPNVLPAIRAN FLLQKGEKRNSAWTLRDSLDGP 752 

Qy 414 REIQLDL 42 0 

: ::| | 
Db 753 QTVKLQL 759 



RESULT 13 
A35626 

transforming growth factor beta -1 -binding protein - human 
C; Species: Homo sapiens (man) 

C;Date: 21-Sep-1990 #sequence_revision 21-Sep-1990 #text_change ll-Jan-2000 
C; Access ion: A3 5 62 6 

R;Kanzaki, T. ; Olofsson, A.; Moren, A. ; Wernstedt, c,; Hellman, U. ; Miyazono, 
K. ; Claesson-Welsh, L.; Heldin, C.H. 



Cell 61, 1051-1061, 1990 

A; Title: TGF-betal binding protein: a component of the large latent complex of 

TGF-betal with multiple repeat sequences . 

A;Reference number: A35626; MUID: 90275601; PMID: 2350783 

A;Accession: A35626 

A; Status: preliminary; not compared with conceptual translation 
A; Molecule type: mRNA 
A; Residues: 1-1394 <KAN> 

A; Cross-references : GB:M34057 ; NID:g339547; PIDN: AAA61160 . 1; PID:g339548 
C; Super family: unassigned EGF-related. proteins; EGF homology 
C; Keywords: alternative splicing 
F;750-791/Domain: EGF homology <EGF> 



Query Match 21.5%; Score 544; DB 2; Length 13 94; 

Best Local Similarity 29.4%; Pred. No. 4e-28; 

Matches 128; Conservative 52; Mismatches 134; Indels 122; Gaps 17; 



Qy 


16 


CL PS PGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPV 
hi ||: i Mllill =1 Ml 'III 


75 


Db 


562 


CINLPVRYTCICYEGYRFSEQQRKCVDIDECTQVQHLC-SQGRCENTEGSFLCI 


614 


Qy 


76 


YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQ 


118 


Db 


615 


: |: : = || = |||: 

CPAGFMAS EEGTN CI DVDECLRPDVCGEGHCVNTVGAFRCEYCDSGYR 


662 


Qy 


119 


MDESNQCVDVDECATDSHQCNPTQI CINTEGGYTC- SCTDGYWLLEGQCLDI DEC- RYGY 

1 : : 1 hill 1 1 h hh 1 1 1 Ihh llllhlll 
MTQRGRCEDIDECLNPS -TC- PDEQCVNSPGSYQCVPCTEGFRGWNGQCLDVDECLEPNV 


176 


Db 


663 


720 


Qy 


177 


CQQ - LCANVPGS YS CTCNPGFTLNEDGRSCQDVNECATENPCVQ 

1 hh III hh hi 1 : hMII I 11 


219 


Db 


721 


CANGDCSNLEGSYMCSCHKGYTRTPDHKHCRDIDECQQGNLCVNGQCKNTEGSFRCTCGQ 


780 


Qy 


220 


TCVNTYGSFI CRCDPGYELEEDGVHCSDMDECSF 


253 


Db 


781 


MM IN II 11 1 II Mil 
GYQLSAAKDQCEDIDECQHRHLCAHGQCRNTEGSFQCVCDQGYRASGLGDHCEDINECLE 


840 


Qy 


254 


SEFLCQH - ECVNQPGTYFCS CPPG Y I LLDDNRS CQD I WECEHRNHTCWLQQTC YNLQGGF 

= =11 Ml hi hll h 1 llh: II II IN II 1 1 1 Ml 1 
DKSVCQRGDCINTAGSYDCTCPDGF-QLDDNKTCQDINECEHPG-LCGPQGECLNTEGSF 


312 


Db 


841 


898 


Qy 


313 


KCI DPIRCEEPYLRI S DN RCMC PAENPGCR 


342 


Db 


899 


1= 1 Ih - II Ihl 1 = II 
HCVCQQGFS I SADGRTCEDI DECVNNTVCDSHGF CDNTAGS FRCLCYQGFQAPQDGQGCV 


958 


Qy 


343 


DQPFTILYRDMDWSG 358 




Db 


959 


1 = -HI 

D VNECELLSG 968 





RESULT 14 
A47221 

fibrillin 1 precursor - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 02-Jun-1995 #sequence_revision 25-Apr-1997 #text_change 02-Aug-2002 
C;Accession: A47221; 154355; S17064; 159574; S17062; S62111; A34198 
R;Corson, G.M.; Chalberg, S.C.; Dietz, h.C ; Charbonneau, N.L.; Sakai, L.Y. 



Genomics 17, 476-484, 1993 

A;Title: Fibrillin binds calcium and is coded by cDNAs that reveal a multidomain 
structure and alternatively spliced exons at the 5 r end. 
A; Reference number: A47221; MUID: 94 010947; PMID: 7691719 
A; Accession: A47221 

A; Molecule type: mRNA 

A;Residues: 1-337 , l T l , 339-1029 <COR> 

A; Cross- references : GB:X63556 

R;Pereira, L.V. ; D'Alessio, M . ; Ramirez, F.; Lynch, J.; Sykes, B.; Pangilinan, 
T. ; Bonadio, J. 

Hum. Mol. Genet. 2, 961-968, 1993 

A; Title: Genomic organization of the sequence coding for fibrillin, the 

defective gene product in Margan syndrome. 

A; Reference number: 154355; MUID: 93372860; PMID : 8364578 

A;Accession: 154355 

A; Status-, preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: mRNA 

A; Residues: 132-3002 <PER> 

A; Cross-references : GB:L13923; NID:g306745; PIDN :AAB02036 . 1; PID:g306746 
R,-Maslen, C.L. ; Corson, G.M. ; Maddox, B.K. ; Glanville, R.W.; Sakai, L.Y. 
Nature 352, 334-337, 1991 

A; Title: Partial sequence of a candidate gene for the Marfan syndrome. 

A; Reference number: S17064; MUID : 91304568 ; PMID: 1852207 

A; Accession: S17064 

A; Molecule type: mRNA 

A;Residues: 1030-3002 <MAS> 

A; Cross-references : EMBL:X63556 

R;Dietz, H.C.; Valle, D. ; Francomano, C.A. ; Kendzior, R.J. 
Science 259, 680-633, 1993 

A; Title: The skipping of constitutive exons in vivo induced by nonsense 
mutations. 

A/Reference number: 159574; MUID: 93157831; PMID:8430317 
A;Accession: 159574 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 

A;Residues: 2217-2288 , 1 1 *, 2290-2325 <RES> 

A; Cross-references : GB:S54426; NID:g264860; PIDN : AAB25244 . 1 ; PID:g264861 
R;Lee, B . ; Godfrey, M . ; Vitale, E . ; Hori, H . ; Mattei, M.G.; Sarfarazi, M . ; 
Tsipouras, P.; Ramirez, P.; Hollister, D.W. 
Nature 352, 330-334, 1991 

A; Title: Linkage of Marfan syndrome and a phenotypically related disorder to two 
different fibrillin genes. 

A;Reference number: S17062; MUID: 91304567; PMID: 1852206 
A;Accession: S17062 
A; Molecule type: mRNA 

A; Residues: ' VLVTWFIFLSYNKML ' , 944-1444 <LEE1> 

A; Cross-references: EMBL:X62008; NID:g31398; PIDN: CABS 6534 .1 ; PID:g5924015 

A/Accession: S62111 

A; Molecule type: protein 

A;Residues: 1166-1176, 'X ', 1178-1180, 'D ', 1182-1185 <LEE2> 
R;Maddox, B.K. ; Sakai, L.Y.; Keene, D.R.; Glanville, R.w. 
J. Biol. Chem. 264, 21381-21385, 1989 

A;Title: Connective tissue microfibrils. Isolation and characterization of three 

large pepsin-resistant domains of fibrillin. 

A;Reference number: A34198; MUID: 90078246; PMID: 2512293 

A;Accession: A34198 

A; Molecule type: protein 



A;Residues: 565-575; 1890-1892, ' I 1894-1900 <MAD> 

C;Comment: Fibrillin is a major component of elastin-associated microfibrils. 

C,- Genetics : 

A; Gene: GDB-.FBNl 

A; Cross-references: GDB: 127115; OMIM: 134797; OMIM: 154700 

A;Map position: 15q21 . 1-I5q21 . 1 

A;Introns: 2236/1; 2258/1; 2297/1 

C; Super family: fibrillin 1; EGF homology 

C; Keywords: alternative splicing; calcium binding; extracellular matrix; 
glycoprotein,- Marfan syndrome 

F; 1-3 002 /Product : fibrillin (5* -region exon A splice form) (fragment) #status 
predicted <MATA> 

F;132-3002/Product: fibrillin (5 '-region exon C splice form) #status predicted 
<MATC> 

F;1332-1367/Domain: EGF homology <EGF> 
F; 1457 -14 92 /Domain: EGF homology <EGF2> 
F;2262-2295/Domain: EGF homology <EGF1> 

Query Match 21.0%; Score 532.5; DB 2; Length 3002; 

Best Local Similarity 30.1%; Pred. No. 4.5e-27; 

Matches 124; Conservative 50; Mismatches 125; Indels 113; Gaps 18; 

Qy 16 CLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPV 7 5 

I : 1= = M Ml II : i illlll h II 111 1*1 : = 
Db 1175 CRNTIGSFKCRCDSGFALDSEERNCTDIDECRISPDLC-GRGQCVNTPGDFEC- -KCDEG 1231 

Qy 76 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISR PLICRF 115 

1 = : I I Ihll 

Db 1232 YESGF MMMKNCMDIDECQRDPLLCRGGVCHNTEGSYRCECPP 1273 

Qy 116 GYQMDES -NQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQ - - CLDI DEC 172 

hh : : h|::|l =111 hi II hi II = hlllll 

Db 1274 GHQLSPNI SACIDINECELSAHLC- PNGRCVNLIGKYQCACNPGYHSTPDRLFCVDIDEC 1332 

Qy 173 R Y - - G YCQQLCANVPGS YS CTCNPGFTLNEDGRS CQDVNECATENP 216 

II-. II Ml hi III I I III hMI Ml 

Db 1333 SIMNGGCETFCTNSEGSYECSCQPGFALMPDQRSCTDIDEC-EDNPNICDGGQCTNIPGE 1391 

Qy 217 CVQ TCVNTYGSFI CRCDPGYELEEDGVHC 245 

II II II Hill II II I 

Db 1392 YRCLCYDGFMASEDMKTCVDVNECDL1IPN I CLSGTCENTKGSFI CHCDMGYSGKKGKTGC 1451 

Qy 246 SDMDECSFS E FLC - QHE - CVNQPGTYFCS CPPG Y I LLDDNRS CQDI NECEHRNHTCNLQQ 303 

M = MI I M M h: Ml MM I 1 hMI = I 1 = 

Db 1452 TD I NECE I GAHNCGKHA VCTNTAGS FKCS CS PGW I - -GDGI KCTDLDECSNGTHMCSQHA 1509 
Qy 3 04 TCYNLQGGF KC I DP I RCEEP YL RISDNRCMCPAENPGCRDQP 345 

I I I ■■■■]■ hi I hi M I hi 

Db 1510 DCKNTMGSYRCL CKEG YTGDGFTCTDLDECS ENLNLC - - GNGQCLNAP 1555 



RESULT 15 
A55567 

fibrillin I - bovine 

C; Species: Bos primigenius taurus (cattle) 

C;Date: 20-Feb-1995 #sequence_revision 20-Feb-1995 #text_change 02-Aug-2002 
C;Accession: A55567 



R;Tilstra, D.J.; Li, L. ; Potter, K.A. ; Womack, J.; Byers, P.H. 
Genomics 23, 480-485, 1994 

A; Title: Sequence of the coding region of the bovine fibrillin cDNA and 
localization to bovine chromosome 10- 

A;Reference number: A55567; MUID: 95137597; PMID: 7835900 
A/Accession: A55567 
A /Status: preliminary 
A;Molecule type: mRNA 

A;ResidueS: 1-2871 <TIL> 

A; Cross-references: GB:L28748; NID : g508427; PIDN: AAA74122 . 1 ; PID:g508428 
C;Superfamily: fibrillin 1; EGP homology 
F;1201-1236/Domain: EGF homology <EGF> 

Query Match ' 21.0%; Score 531.5; DB 2; Length 2371; 

Best Local Similarity 29.6%; Pred. No. 5e-27; 

Matches 122; Conservative 48; Mismatches 129; Indels 113; Gaps 16; 

Qy 16 CLPSPGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCT PRTNPV 75 

I •■ h ■• H :|| II : I II I III I- I I III I I 
Db 1044 CRNTIGSFKCRCDSGFALDSEERNCTDIDECRISPDLC-GRGQCVNTPGDFEC--KCDEG 1100 

Qy 76 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTTSR PLI CRFGYQMDES 122 

I = =11 IMI I 

Db 1101 YESGF -MMMKNCMDIDECQRDPLLCRGGVCLNTEGSYRCECPP 1142 

Qy 123 NQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQ--CLDIDEC 172 

= hhW'l :| I I 1 = 1 I I 1 = 1 II : MINI 

Db 1143 GHQLAPNISACIDINECELSAHLC-PHGRCVNLIGKYQCACNPGYHSTPDRLFCVDIDEC 1201 

Qy 173 RY--GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENP 216 

11= I I Ml 1=1 III I I III l==ll =11 
Db 1202 S I MNGGCETFCTNSEGS YECSCQPGFALMPDQRS CTDI DEC - EDNPNI CDGGQCTNI PGE 1260 

Qy 217 CVQ TCVNTYGSF I CRCDPGYELEEDGVHC 245 

II . II II Mill 1111== I 

Db 1261 YRCLCYDGFMASEDMKTCVDVNECDLNPN I CLSGTCENTKGSFI CHCDMGYSGKKGKTGC 1320 

Qy 246 SDMDECSFSEFLCQHE--CVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQ 303 

= l = = il I I I l== Ill Ihl I I h = ll *• I |: 

Db 1321 TDINECEIGAHNCDRHAVCTNTAGSFKCSCSPGWI - -GDGIKCTDLDECSNGTHMCSQHA 1378 

Qy 304 TCYNLQGGFKCI DPI RCEEPYL RI SDNRCMCPAENPGCRDQP 345 

I I I ==1= 1=1 I hi = 11 1=1 

Db. 1379 DCKNTMGSYRCL CKEGYTGDGFTCTDLDECSENLNLC--GNGQCLNAP 1424 



Search completed: January 9, 2004, 12:34:53 
Job time : 19.9736 sees 
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Perfect score: 
Sequence : 

Scoring table: 
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(without alignments) 
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Minimum DB seq length: 0 
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Post -processing: Minimum Match 0% 
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Listing first 45 summaries 



127863 



Database 



Swiss Prot 41:* 



Pred. NO: is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
NO- 


Score 


Query 

Match Length DB 


ID 


Description 


1 


2533 


100 


0 


448 


1 


FBL5 HUMAN 


Q9ubx5 


homo sapien 


2 


2406 


95 


0 


448 


1 


FBL5 RAT 


Q9wh8 


rattus norv 


3 


2405 


94 


9 


448 


1 


FBL5__M0USE 


Q9wvh9 


mus musculu 


4 


1290 


50 


9 


443 


1 


FBL4_CRIGR 


055058 


cricetulus 


5 


1289 


50 


9 


443 


1 


FBL4_HUMAN 


095967 


homo sapien 


6 


1282 


50 


6 


443 


1 


FBL4_M0USE 


Q9wvj9 


mus musculu 


7 


1109.5 


43 


8 


493 


1 


FBL3 RAT 


035568 


rattus norv 


8 


1101 


43 


5 


493 


1 


FBL3_HUMAN 


Q12805 


homo sapien 


9 


733 .5 


29 


0 


1221 


1 


FBL2 MOUSE 


P37889 


mus musculu 


10 


728 


28 


7 


598 


1 


FBL1 CERAE 


Q8mjj9 


cercopithec 


11 


722 


28 


5 


704 


1 


FBL1_CHICK 


073775 


gallus gall 


12 


709.5 


28 


0 


1184 


1 


FBL2 HUMAN 


P98095 


homo sapien 


13 


701.5 


27 


7 


703 


1 


FBL1_HUMAN 


P23142 


homo sapien 


14 


700.5 


27 


7 


705 


1 


FBL1_M0USE 


Q08879 


mus musculu 


15 


631 


24 


9 


681 


1 


FBL1 BRARE 


042182 


brachydanio 


16 


577.5 


22 


8 


798 


1 


FBL1_CAEEL 


077469 


caenorhabdi 


17 


544 


21 


5 


1394 


1 


LTBS HUMAN 


P22064 


homo sapien 



18 


544 


21 


5 


1595 


1 


LTBL HUMAN 


Q14766 


hoTfio sapien 


19 


532 . 5 


21 


0 


2871 


1 


FBN1 HUMAN 


P35555 


homo sapien 


2 0 


531-5 


2 1 


0 


2871 


1 


rbWl BUvliM 


P98133 


bos taurus 


21 


525 . 5 


2 0 


7 


2871 


1 


FBN1 PIG 


Q9tv36 


sus scrofa 


22 


523 


20 


6 


1712 


1 


LTB 1 RAT 


Q00918 


rat,tus norv 


23 


521 


20 


6 


1389 


1 


LTBS MOUSE 


Q8cgl8 


mus musculu 


24 


521 






1713 


1 




Q8cgl9 


mus musculu 


25 


518 . 5 


?0 


5 


2871 


1 


DDM1 MI/~lT ICD 

F fcSIM l^MUUbk 


Q61554 


mus musculu 


26 


518.5 


20 . 


5 


2907 


1 


FBN2 MOUSE 


Q61555 


mus musculu 


27 


517 • 5 


20 




2911 


1 


COW) OIIMAV 

HUJV1A1M 


P35556 


homo sapien 


28 


475 


18 


8 


956 


1 


WTTW} TTTTtVATVM 

PI 1 W ^ HulYlAr) 




homo s ap i en 


29 


452 


17 . 


8 


956 


1 


llrfHP'KTO h*T/"\T TOT? 


U(Jo746 


mus musculu 


3 0 


421 


16 . 


6 


931 


1 




Q61549 


mus musculu 


31 


392 


15 


5 


886 


1 


EMR1__HUMAN 


Q14246 


homo sapien 


32 


379 


15 . 


0 


24 70 


1 


NTC2_M0USE 


035516 


mus musculu 


33 


377 


* 






1 


N ± HU1 V 1AJN 




homo sapien 




375 


14 . 


8 


810 


1 


VTC*T 1 tTTTTVA7\"M 
Wii-Lli. HUMAfJ 


yyzo3^ 


homo sapien 


35 








816 


1 


VTCT O tuff"Vr TCT? 




mus musculu 


36 


371*5 


14 


7 








Q9qw3 0 


rattus norv 


37 


366 


14 . 




816 


1 






homo sapien 


38 


364 


14 




810 






Q62919 


rattus norv 


39 


360 . 5 


14 . 


2 


816 


1 


NEL CHICK 


Q90827 


era llnct era 1 1 


40 


359.5 


14. 


2 


1964 


1 


NTC4_M0USE 


P31695 


mus musculu 


41 


359 


14. 


2 


816 


1 


NEL2_RAT 


Q62918 


rattus norv 


42 


356 


14. 


1 


2703 


1 


N0TC__DR0ME 


P07207 


drosophila 


43 


355.5 


14 . 


0 


2437 


1 


NTC1_BRARE 


P46530 


brachydanio 


44 


350-5 


13. 


8 


652 


1 


CD93_HUMAN 


Q9npy3 


homo sapien 


45 


347.5 


13. 


7 


2556 


1 


NTC INHUMAN 


P46531 


homo sapien 



ALIGNMENTS 



RESULT 1 
FBL5_HUMAN 

ID FBL5_HUMAN STANDARD; PRT; 448 AA. 

AC Q9UBX5 ; 075966; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fifcailin-5 precursor (FIBL-5) (Developmental arteries and neural crest 

DE EGF-like protein) (Dance) (Urine p50 protein) (UP50) . 

GN FBLN5 OR DANCE. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Melanoma; 

RA Kostka G . ; 

RL Submitted (MAR-1999) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99357779; PubMed=l 0428 823 ; 

RA Nakamura T. , Ruiz-Lozano P. , Lindner V., Yabe D. , Taniwaki M., 

RA Furukawa Y. , Kobuke K. , Tashiro K. . Lu Z., Andon N.L. , Schaub R. . 



RA Matsumori A., Sasayama S., Chien K.R., Honjo T.; 

RT "DANCE, a novel secreted RGD protein expressed in developing, 

RT atherosclerotic , and balloon -injured arteries."; 

RL J. Biol. Chem. 274:22476-22483(1999]. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Urine; 

RA Zemel R. , Sholto 0. , Shaul Y. ; 

RL Submitted (SEP-1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: PROMOTES ADHESION OF ENDOTHELIAL CELLS THROUGH 

CC INTERACTION OF INTEGRINS AND THE RGD MOTIF. COULD BE A VASCULAR 

CC LIGAND FOR INTEGRIN RECEPTORS AND MAY PLAY A ROLE IN VASCULAR 

CC DEVELOPMENT AND REMODELING. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -I- TISSUE SPECIFICITY: EXPRESSED PREDOMINANTLY IN HEART, OVARY, AND 
CC COLON BUT ALSO IN KIDNEY, PANCREAS, TESTIS, LUNG, AND PLACENTA. 

CC NOT DETECTABLE IN BRAIN, LIVER, THYMUS, PROSTATE , OR PERIPHERAL 

CC BLOOD LEUKOCYTES- 

CC -J- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; AJ133490; CAB38568.1; 

DR EMBL; AF112152; AAD41768.1; -. 

DR EMBL; AF093118; AAC62107.1; 

DR HSSP; P00736; 1APQ. 

DR Genew; HGNC:3602; FBLN5 . 

DR MIM; 604580; -. 

DR GO; GO: 0005578; C : extracellular matrix; TAS. 

DR GO; GO: 0005625; C: soluble fraction; TAS. 

DR GO; GO: 0005178; F:integrin binding activity; TAS. 

DR GO; GO: 0007160; P : cell -matrix adhesion; TAS. 

DR InterPro; IPR000152; Asx__hydroxyl . 

DR InterPro; IPR001881; EGF_Ca. 

DR InterPro; I PRO 062 09; EGF_like. 

DR Pfam; PF00008; EGF; 4. 

DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS00022; EGF_1 ; FALSE__NEG. 

DR PROSITE; PS01186; EGF_2; 4. 

DR PROSITE; PS01187; EGF_CA; 6. 

KW Cell adhesion; Calcium-binding,- Repeat; Signal; EGF-like domain; 

KW Glycoprotein . 

FT SIGNAL 1 23 POTENTIAL. 

FT CHAIN 24 448 FIBULIN-5. 

FT DOMAIN 24 69 EGF-LIKE 1, DIVERGENT . 

FT DOMAIN 127 167 EGF-LIKE 2, CALCIUM- BINDING (POTENTIAL) . 

FT DOMAIN 168 206 EGF-LIKE 3, CALCIUM -BINDING (POTENTIAL) - 

FT DOMAIN 207 246 EGF-LIKE 4 , CALCIUM- BINDING (POTENTIAL) . 

FT DOMAIN 247 287 EGF-LIKE 5, CALCIUM-BINDING (POTENTIAL) . 



FT 


JJUl v iAlJN 


288 


333 


TTf^C-T.T ICW 6 PaT.fTTTM-RTKTDTMrt f DOTRMTT AT, I 
Jlkjf XjX JtlCj U / l fi±jl— X UJ- -1 D X IN L/X \ Ir^Ji-t-'W 1 J-riJ-i/ 


FT 




54 


56 


L--£-jJ-jJ-i £\X X Ak~IlL'i£jl>l 1 oXXJCi (Ir^JXEiDi X X&J-l) ' 


FT 


riT CTTT.PTn 


131 


144 


DV CTMTT, ZVRTTV 




JJ J. O U -Li r±JJ 


138 


153 


Dl Oll v Jl XjAKJ. X X . 


FT 


pit CT IT.'R'T'n 


155 


166 




FT 


T\T CT TT T7TT1 




its l 


DV CTMTT.IDTTV 

tlx faxiyilixAKl 1 1 . 




nT CT IT TTTTt 


177 




DV CTMTT BDTTV 


FT 


JJX oUJ-ir XJJ 


192 


205 


DV CTMTT.aDTTV 
til oXl v JX liHJtl X X . 




PiT CI TT UTPi 
D loULic 1 JJ 






DV CTMTT SDTTV 


FT 


I~lT CT TT.TTTTi 
JJl OUXJl* XJJ 


217 


230 


DV CTMTT.IDTTV 
Dl oll v i± XJfiKl X 1 . 


FT 


JJJ. i>Uur x JJ 


232 


245 


RV CTMTT.aDTTV 
Dl OXJYJI XjttKX I X . 


FT 


nT CT TT .17171 


251 


262 


RV CTMTT.&RTTV 


FT 


DISULFID 


258 


271 


BY SIMILARITY. 


FT 


TIT CTTT.PTn 


273 


286 


RV CTMTT.&RTTV 


FT 


DTCTTT.TTTr) 


292 


305 


RV CTMTT, apTTV 
OI OXIIXXIUKX 1 X . 


FT 


DISULFID 


299 


314 


BY SIMILARITY. 


FT 


DISULFID 


320 


332 


BY SIMILARITY. 


FT 


CARBOHYD 


283 


283 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


296 


296 


N-LINKED (GLCNAC. . . ) (POTENTIAL) . 


FT 


CONFLICT 


69 


70 


IP -> HS (IN REF. 3) . 


FT 


CONFLICT 


147 


148 


TE -> MK (IN REF. 3) . 


SQ 


SEQUENCE 


448 AA; 


501* 


i0 MW; 19FCA51FDA328003 CRC64 ; 



Query Match 100.0%; Score 2533; DB 1; Length 448; 

Best Local Similarity 100.0%; Fred. No. 1.4e-180; 

Matches 448; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


i 


MPGI KRI LTVTI LALCLPS PGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 


60 


Db 


i 


lillllililllllltllllllllillllllllNlllllllllltllllllllllllll 
MPGI KRI LTVTI LALCLPS PGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCV 


60 


Qy 


61 


NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD 

IIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
NQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD 


120 


Db 


61 


120 


Qy 


121 


ESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 
IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIII 


180 


Db 


121 


ESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 


180 


Qy 


181 


CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 

llilllllllllllllillll!!lllllllll!l!llllllilllllilltlllillili 
CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 


240 


Db 


181 


240 


Qy 


241 


DGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 

l!lllllljlllllllllll[||||lllllll!lllllfllllllllllll!llllllll 

DGVHCSDMDECSFS EFLCQHECVNQPGTYFCSCPPGYI LLDDNRS CQD I NECEHRNHTCN 


300 


Db 


241 


300 


Qy 


301 


LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 


360 


Db 


301 


lllfllllllllililllillilMllllilMlliilllillllililillllliill) 

LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 


360 


Qy 


361 


VPADIFQMQATTRYPGAYYIFQIKSGNEGREFYMRQTGPISATLVMTRPIKGPREIQLDL 

(lllllllIIINIIlUIIlillMllililUlllllillilillillllllllllll 

VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREIQLDL 


420 


Db 


361 


420 


Qy 


421 


EMI TVNTVI NFRGS S VI RLR I YVSQYPF 448 





llllllllllllllllllllllllllll 



Db 421 EM I T VNTVINFRGSS VI RLR I YVSQYP P 448 



RESULT 2 
FBL5JRAT 

ID FBL5_RAT STANDARD; PRT; 448 AA. 

AC Q9WVH8 ; Q9R284; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2G01 {Rel. 40, Last sequence update) 

DT 15-SEP-2 003 (Rel. 42, Last annotation update) 

DE Fibulin-5 precursor (FIBL-5) (Developmental arteries and neural crest 

DE EGF-like protein) (Dance) (Embryonic vascular EGF repeat -containing 

DE protein) (EVEC) . 

GN FBLN5 OR DANCE. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10U6; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99357779; PubMed=l 0428823 ; 

RA Nakamura T., Ruiz-Lozano P., Lindner V., Yabe D. , Taniwaki M., 

RA Furukawa Y., Kobuke K. , Tashiro K. , Lu Z. ( Andon N.L., Schaub R. , 

RA Matsumori A., Sasayama S., Chien K.R- , Honjo T.; 

RT "DANCE, a novel secreted RGD protein expressed in developing, 

RT atherosclerotic, and balloon-injured arteries. 11 ; 

RL J. Biol. Chem. 274:22476-22483(1999). 

RN 12] 

RP SEQUENCE FROM N.A. 

RX MEDLI NE= 99278197; PubMed=l 0347091 ; 

RA Kowal R.C., Richardson J. A. , Miano J.M., Olson E.N.; 

RT "EVEC, a novel epidermal growth factor- like repeat -containing protein 

RT upregulated in embryonic and diseased adult vasculature."; 

RL Circ. Res. 84:1166-1176(1999). 

CC -I- FUNCTION: PROMOTES ADHESION OF ENDOTHELIAL CELLS THROUGH 

CC INTERACTION OF INTEGRINS AND THE RGD MOTIF. COULD BE A VASCULAR 

CC LIGAND FOR INTEGRIN RECEPTORS AND MAY PLAY A ROLE IN VASCULAR 

CC DEVELOPMENT AND REMODELING. 

CC -J- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC - [ - SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bio informatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AF112153; AAD4 1769.1; 

DR EMBL; AF137350; AAD25101.1; -. 

DR HSSP; POO 73 6; 1APQ. 

DR InterPro; IPR000152; Asxjiydroxyl . 

DR InterPro; I PRO 01881; EGF_Ca. 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF00008; EGF; 4. 



DR SMART; SM00179; EGF_CA; 4. 

DR PROS ITS; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS 00 02 2; EGFJ.; FALSE_NEG. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 6. 

KW Cell adhesion; Calcium- binding; Repeat; Signal; EGF-like domain; 

KW Glycoprotein. 



FT 


O T /TNT A T 


1 


23 




FT 


CHAIN 


24 


448 


FIBULIN-5 . 


FT 


DOMAIN 


24 


69 


EGF-LIKE 1, DIVERGENT. 


FT 


1JUMA.J.W 


127 


167 


JaOr -lilis-Ci Z f LALC1 UM-Bi-NDING (POTENTIAL) 


FT 


DOMAIN 


168 


206 


EGF-LIKE 3, CALCIUM-BINDING (POTENTIAL) 


FT 


DOMAIN 


2 07 


246 


EGF-LIKE 4, CALCIUM-BINDING (POTENTIAL) 


FT 


DOMAIN 


247 


287 


EGF-LIKE 5, CALCIUM-BINDING (POTENTIAL) 


FT 


DOMAIN 


288 


333 


EGF-LIKE 6, CALCIUM-BINDING (POTENTIAL) 


FT 


SITE 


54 


56 


LejLJj Al lALnJVliiiJN I blxii ( POihiNTIALJ . 


FT 


JJioULr 1U 


131 


144 


"DV OTTvjfTT 7\T^T r PV 

□ i olMll-iAKl x x . 


FT 


V\T OTTT DTPl 

UloUiir LU 


138 


153 


OV OTMTT AOTTV 

ol bxMl J-1A.KI X x . 


FT 


DISULFID 


155 


166 




FT 


riT OTTT urn 
JJlbULr LU 


172 


181 


di lb J.M1 LftKl X X . 


FT 


UlbUiir LU 


177 


190 


DI blMlLAKx. 1 x . 


FT 




192 


205 


DV QTMTT.APTTV 

OX O X IMX XJrXr\-l- X X * 


FT 


DISULFID 


211 


221 


BY SIMILARITY. 


FT 


DISULFID 


217 


230 


BY SIMILARITY. 


FT 


DISULFID 


232 


245 


BY SIMILARITY. 


FT 


DISULFID 


251 


262 


BY SIMILARITY. 


FT 


DISULFID 


258 


271 


BY SIMILARITY. 


FT 


DISULFID 


273 


286 


BY SIMILARITY. 


FT 


DISULFID 


292 


305 


BY SIMILARITY . 


FT 


DISULFID 


299 


314 


BY SIMILARITY. 


FT 


DISULFID 


320 


332 


BY SIMILARITY. 


FT 


CARBOHYD 


283 


283 


N- LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


296 


296 


N-LINKED (GLCNAC. . .) (POTENTIAL) . 


FT 


CONFLICT 


238 


238 


L -> P (IN REF. 2) . 


SQ 


SEQUENCE 


448 AA; 


50160 


MW; E6BC68F7BF14B714 CRC64; 



Query Match 95.0%; Score 2406; DB 1; Length 448; 

Best Local Similarity 94.2%; Pred. No. 3. 5 e- 171; 

Matches 422; Conservative 11; Mismatches 15; Indels 0; Gaps 0; 

Qy 1 MPGI KRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

llhlllllilllll II Mill 
Db 1 MPGLKRI LTVTI LALWLPHPGNAQQQCTNGFDLDRQTGQCLD I DECRTI PEACRGDMMCV 60 

Qy 61 NQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMD 120 

IMIIIIIIIMIIMIIIMIIIM IIIIMIIIII: I llllllllhllilllll 

Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTSYSGPYPAAAPPVPASNYPTI SRPLVCRFGYQMD 120 

Qy 121 ESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 180 

( IlilltlUllliMililliltiillilllllllllllllillllllllllllllli 

Db 121 EGNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQL 180 

Qy 181 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE 240 

IMIIIIIMHIIIIIIhlllllllllll Illllllllllllllllllllfllllll 

Db 181 CANVPGS YS CTCNPGFTLNDDGRS CQDVNECETENPCVQTCVNTYGSF I CRCDPGYELEE 240 



Qy 



241 DG VHCSDMDECSFS EFLCQHE CVNQPGTYFCS CP PGY I LLDDNRS CQDINE CEHRNHTCN 3 00 



IhllllMllllllllllillMIIMMIIIIIhlMIIIIIIIIIIIIIIIil 

Db 241 DGIHCSDMDECSFSEFLCQHECVNQPGSYFCSCPPGYVLLEDNRSCQDINECEHRNHTCT 3 00 

Qy 301 LQQTC YNLQGGFKC I DP I RCE E P YLR I SDNRCMCPAENPGCRDQPFTI L YRDMD WSGRS 360 

IIIIIMIIIMIIII WWW I IIIIIIIIII lllllllllhllllllllll 

Db 301 PLQTCYNLQGGFKCIDPIVCEEPYLLIGDNRCMCPAENTGCRDQPFTILFRDMDWSGRS 360 

Qy 361 VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPRE I QLDL 420 

I I I I I I I I I i M I I I I I I I I i I I I I I I I I I I M I I I I I I | | I I | I | i | | | | | | i : | I | | I 
Db 361 VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRP I KGPRDI QLDL 420 

Qy 421 EMITVNTVINFRGSSVIRLRIYVSQYPF 448 

I I I I I I I I I ! I f I I I I I I I I f I I I I I I 1 
Db 421 EMI TVNTVI HFRGS S VI RLR I YVSQ YPF 448 

RESULT 3 
FBL5__MOUSE 

ID FBL5JVI0USE STANDARD; PRT; 448 AA. 

AC Q9WVH9 ; 

DT 16-OCT-2001 (Rel . 40 , Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-5 precursor (PIBL-5) (Developmental arteries and neural crest 

DE EGF-like protein) (Dance) . 

GN FBLN5 OR DANCE . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1J 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99357779; PubMed=10428 823 ; 

RA Nakamura T. , Ruiz-Lozano P. , Lindner V., Yabe D. , Taniwaki M. , 

RA Furukawa Y., Kobuke K., Tashiro K. , Lu Z., Andon N.L., Schaub R . , 

RA Matsumori A., Sasayama S., Chien K.R., Honjo T.; 

RT "DANCE, a novel secreted RGD protein expressed in developing, 

RT atherosclerotic, and balloon -injured arteries."; 

RL J. Biol. Chem. 274:22476-22483(1999). 

CC -I- FUNCTION: PROMOTES ADHESION OF ENDOTHELIAL CELLS THROUGH 

CC INTERACTION OF INTEGRINS AND THE RGD MOTIF. COULD BE A VASCULAR 

CC LIGAND FOR INTEGRIN RECEPTORS AND MAY PLAY A ROLE IN VASCULAR 

CC DEVELOPMENT .AND REMODELING. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -l- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AF112151; AAD41767.1; 

DR HSSP; POO 73 6; 1APQ. 



DR 


MGD; MGI: 


1346091; 


FblnS . 






DR 


InterPro; 


IPR000152; Asx hydroxyl . 


DR 


Inter Pro ; 


IPR00188 


1; EGF_ 


Ca. 




DR 


InterPro; 


IPR0Q6209; EGF 


like. 




DR 


Pfam; PF00008; EGF; 4. 






DR 


SMART ; SM00179; EGF CA; 4 






DR 


PROSITE; 


PS00010; 


ASX HYDROXYL; 


4 . 


DR 


PROSITE; 


PS00022; 


EGF 1; 


FALSE NEG. 


DR 


PROSITE; 


PS01186; 


EGF 2; 


4. 




DR 


PROSITE; 


PS01187; 


EGF CA; 


6. 




KW 


Cell adhesion; Calcium-binding; 


Repeat; Signal; EGF-like domain; 


KW 


Glycoprotein. 








FT 


SIGNAL 


1 


23 


POTENTIAL . 


FT 


CHAIN 


24 


448 


FIBULIN-5. 


FT 


DOMAIN 


24 


69 


EGF -LIKE 1, DIVERGENT. 


FT 


DOMAIN 


127 


167 


EGF -LIKE 2, CALCIUM-BINDING (POTENTIAL) 


FT 


DOMAIN 


168 


206 


EGF -LIKE 3, CALCIUM- BINDING (POTENTIAL) 


FT 


DOMAIN 


207 


246 


EGF -LIKE 4, CALCIUM- BINDING (POTENTIAL) 


FT 


DOMAIN 


247 


287 


EGF-LIKE 5, CALCIUM- BINDING (POTENTIAL) 


FT 


DOMAIN 


288 


333 


EGF -LIKE 6, CALCIUM-BINDING (POTENTIAL) 


FT 


SITE 


54 


56 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


DISULFID 


131 




BY 


SIMILARITY. 


FT 


DISULFID 


138 


153 


BY 


SIMILARITY. 


FT 


DISULFID 


155 


166 


BY 


SIMILARITY. 


FT 


DISULFID 


172 


181 


BY 


SIMILARITY. 


FT 


DISULFID 


177 


190 


BY 


SIMILARITY. 


FT 


DISULFID 


192 


205 


BY 


SIMILARITY. 


FT 


DISULFID 


211 


221 


BY 


SIMILARITY. 


FT 


DISULFID 


217 


230 


BY 


SIMILARITY. 


FT 


DISULFID 


232 


245 


BY 


SIMILARITY. 


FT 


DISULFID 


251 


262 


BY 


SIMILARITY. 


FT 


DISULFID 


258 


271 


BY 


SIMILARITY. 




DISULFID 


273 


286 


BY 


SIMILARITY. 


FT 


DISULFID 


292 


305 


BY 


SIMILARITY. 


FT 


DISULFID 


299 


314 


BY 


SIMILARITY. 


FT 


DISULFID 


320 


332 


BY 


SIMILARITY. 


FT 


CARBOHYD 


283 


283 


N-LINKED (GLCNAC. . .) (POTENTIAL) . 


FT 


CARBOHYD 


296 


296 


N -LINKED (GLCNAC , . . ) (POTENTIAL). 


SQ 


SEQUENCE 


448 AA; 


50193 


MW; F15CC70CCFBFDC97 CRC64 ; 



Query Match 94.9%; Score 2405; DB 1; Length 448; 

Best Local Similarity 94.2%; Pred. No. 4.1e-171; 

Matches 422; Conservative 10; Mismatches 16; Indels 0; Gaps 0 
Qy 1 MPGI KRILTVTILALCLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

iihiiiiiiiiiii ii inn iiiiiiiiiiiiiiiiiiiiMiiiiiiiiniii 

Db 1 MPGLKRILTVTILALWLPHPGNAQQQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCV 60 

Qy 61 NQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD 120 

■'Illlll'M i llllll I Ml I III! lillllMIM: ! IIIMIIlMimili 

Db 61 NQNGGYLCI PRTNPVYRGPYSNPYSTSYSGPYPAAAPPVPASNYPTI SRPLVCRFGYQMD 120 

Qy 121 ESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 180 

I llllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIfllllllllllfllllll 

Db 121 EGNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQL 18 0 

Qy 181 CANVPGS YSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEE 240 



IIIIIIIIIIMIIIIIIMJIIIIIIIIi MIlilillillMllllllllllllli 

Db 181 CANVPGSYSCTCNPGFTLNDDGRSCQDVNECETENPCVQTCVNTYGSFICRCDPGYELEE 240 

Qy 241 BGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCN 3 00 

l|:|||||||||IIMIIIIIillllh!lilllll|:|||ll|l|||||||lll|||| 
Db 241 DG I HCSDMDECSFSEFLCQHECVNQPGS YF CSCP PGYVLLDDNRSCQD I NECEHRNHTCT 300 

Qy 301 LQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRS 360 

lilllilllltlllll llllll I '!,.',, ,. IIIIMNMItllilllll 
Db 301 SLQTC YNLQGGFKC I D P I S CEEP YLL I GENRCMCPAEHTS CRDQPFT I L YRDMD WSGRS 360 

Qy 361 VPADI FQMQATTRYPGAYYI FQIKSGNEGREFYMRQTGPI SATLVMTRPI KGPREIQLDL 420 

Db 361 VPADIFQMQATTRYPGAYYIFQIKSGNEGREFYMRQTGPISATLVMTRPIKGPRDIQLDL 420 

Qy 421 EM I TVNTVI NFRGS S VI RLR I YVSQYPF 448 

MIllIlllllKIIIMMIllllIll 
Db 421 EMI TVNTVI NFRGS SVI RLR I YVSQYPF 448 

RESULT 4 
FBL4_CRIGR 

ID FBL4_CRIGR STANDARD; PRT; 443 AA. 

AC 055058; 

DT 16-OCT-2001 {Rel . 40, Created) 

DT 16-OCT-2 001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE EGF- containing fibulin-like extracellular matrix protein 2 precursor 

DE (Fibulin-4) (FIBL-4) (H411 protein) . 

GN EFEMP2 OR FBLN4 . 

OS Cricetulus griseus (Chinese hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Cricetinae; 

OC Cricetulus - 

OX NCBI_TaxID=1002 9; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RA Heine H. , Delude R.L., Monks B., Golenbock D.T.; 

RL Submitted (FEB-1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- SUBCELLULAR LOCATION : Secreted. 

CC -!- SIMILARITY : Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license®isb-sib.ch) . 

CC 

DR EMBL; AF046870; AAC03101.1; -. 

DR HSSP; POO 73 6; 1APQ. 

DR Inter Pro; I PRO 00 152 ; Asx_hydroxyl . 

DR Inter Pro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 



DR Inter Pro; I PRO 014 91; Thrmbomoduln. 

DR Pfam; PF00008; EGF; 4. 

DR PRINTS; PRO 09 07; THRMBOMODULN. 

DR SMART; SM00179; EGF__CA; 4. 



DR 


PROSITE; 


PS00010; 


ASX HYDROXYL; 


4. 




DR 


PROSITE; 


PS00022; 


EGF_1 ; 


FALSE NEG. 




DR 


PROSITE; 


PS01186; 


EGF_2 ; 


4. 






DR 


PROSITE; 


PS01187; 


EGF_CA ; 


; 6. 






KW 


Repeat; EGF-like 


domain ; 


Calcium-binding; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


25 


POTENTIAL. 




FT 


CHAIN 


26 


443 


EGF -CONTAINING FIBULIN-LIKE EXTRACELLULAR 


FT 








MATRIX PROTEIN 2. 




FT 


DOMAIN 


36 


81 


EGF-LIKE 1, DIVERGENT. 


FT 


DOMAIN 


123 


163 


EGF-LIKE 2, CALCIUM - 


-BINDING (POTENTIAL) . 


FT 


DOMAIN 


164 


202 


EGF-LIKE 3, CALCIUM- 


•BINDING (POTENTIAL) . 


FT 


DOMAIN 


203 


242 


EGF-LIKE 4, CALCIUM - 


•BINDING (POTENTIAL) . 


FT 


DOMAIN 


243 


282 


EGF-LIKE 5, CALCIUM- 


X2 T lin T ■NT/" 1 / DnTiCMTT At \ 

-jDiJNJUJLJNtj \ r U JL bJM I J. ALi ) . 


FT 


DOMAIN 


283 


328 


EGF-LIKE 6, CALCIUM- 


■BINDING (POTENTIAL) . 


FT 


DISULFID 


127 


140 


BY 


SIMILARITY. 




FT 


DISULFID 


134 


149 


BY 


SIMILARITY. 




FT 


DISULFID 


151 


162 


BY 


SIMILARITY. 




FT 


DISULFID 


168 


177 


BY 


SIMILARITY. 




FT 


DISULFID 


173 


186 


BY 


SIMILARITY. 




FT 


DISULFID 


188 


201 


BY 


SIMILARITY. 




FT 


DISULFID 


207 


217 


BY 


SIMILARITY. 






DISULFID 


213 


226 


BY 


SIMILARITY. 




FT 


DISULFID 


228 


241 


BY 


SIMILARITY. 




FT 


DISULFID 


247 


258 


BY 


SIMILARITY. 




FT 


DISULFID 


254 


267 


BY 


SIMILARITY. 




FT 


DISULFID 


269 


281 


BY 


SIMILARITY. 




FT 


DISULFID 


287 


300 


BY 


SIMILARITY. 




FT 


DISULFID 


294 


309 


BY 


SIMILARITY. 




FT 


DISULFID 


315 


327 


BY 


SIMILARITY. 




FT 


CARBOHYD 


198 


198 


N- LINKED (GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


394 


394 


N-LINKED (GLCNAC. . 


.) (POTENTIAL). 



SQ SEQUENCE 443 AA; 49432 MW; 0BCFE5D7323D9E5F CRC64 ; 

Query Match 50.9%; Score 1290; DB 1; Length 443; 

Best Local Similarity 50.1%; Pred. No. 1.5e-88; 

Matches 227; Conservative 71; Mismatches 133; Indels 22; Gaps 5; 

Qy 1 M PG I KR I LTVTI LALCL PS PGNAQ AQCTNGFDLDRQSGQCLDIDECRTI PEACRG 55 

= 1! : ::|| II =!!=! = = I I I h=ll IM hi 

Db 8 LPGSLLLWALLLLLLGAASPQDSEEPDSYTECTDGYEWDADSQHCRDVNECLTIPEACKG 67 

Qy 56 DMMC VNQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRF 115 

i hi llllhlh ; II I II || I 
Db 68 EMKCINHYGGYLCLPRSAAVINDLHG EGP-PPPVPPAQHPN PCPP 111 

Qy 116 GYQMDESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECRYG 175 

lh II llllllll I I hi I I I I hi III : -hill i. 
Db 112 GYE PDEQES CVDVDE CAQALHDCR PS QDCHNLPGS YQCTC PDGYRKVGPECVD I DE CRYR 171 

Qy 176 YCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPG 235 

Ml I hllh I I III I : III Mill II I I hlhhllh I 

Db 172 YCQHRCvNLPGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCNOG 231 



Qy 


236 


Db 


232 


Qy 


296 


Db 


291 


Qy 


356 


Db 


351 


Qy 


416 


Db 


411 



III II llhlllhl =111= llhll = I II II II I lllhlll 

YELHRDGF S CSD I DE CS YS S YLCQYRCVNE PGRFS CHCPQGYQLL - ATRLCQD I DECETG 

NHTCNLQQTCYNLQGGFKC I DP I RCEEPYLRI SDNRCMCPAENPGCRDQPFTILYRDMDV 

I I III I I! E!|:::|llli II II I I I I "- I I = 

AHQCSEAQTCVNFHGGYRCVDTNRCVEPYVQVSDNRCFCPVSNPLCREQPSS I VHRYMS I 

VSGRSVPADI FQMQATTRYPGAYYI PQI KSGNEGREFYMRQTGPI SATLVMTRPI KG PRE 

I lllllhlhllh Mill II: ! =11 = 11 =11 11= 11= 1 1 1 1 

TSERSVPADVFQI QATS VY PGAYNAFQ I RAGNTQGDFYI RQ I NNVS AMLVLARP VTG PRE 

I QLDLEM I TVNT VI NFRGS S VI RLR I YVSQY P F 448 
l|[|hhh:- = l llhll -I I I 



350 



RESULT 5 
FBL4_HUMAN 

ID FBL4_HUMAN STANDARD; PRT; 443 AA. 

AC 095967; 075967; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE EGF-containing fibulin-like extracellular matrix protein 2 precursor 

DE (Fibulin-4) (FIBL-4) (UPH1 protein) . 

GN EFEMP2 OR FBLN4 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Melanoma; 

RX MEDLINE=2 0068 041; PubMed=10601734 ; 

RA Giltay R. , Timpl R. , Kostka G. ; 

RT "Sequence, recombinant expression and tissue localization of two novel 

RT extracellular matrix proteins, fibulin-3 and fibulin-4."; 

RL Matrix Biol. 18:469-480(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Zemel R . , Shaul Y . ; 

RL Submitted (SEP-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20435063; PubMed=10982184 ; 

RA Katsanis N . , Venable S., Smith J.R., Lupski J.R.; 

RT "Isolation of a paralog of the Doyne honeycomb retinal dystrophy gene 

RT from the multiple retinopathy critical region on llql3."; 

RL Hum. Genet. 106:66-72(2000). 
RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L w Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 



RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F. , Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D. , Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J. , Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X. , Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y . , Bouffard G.G., 

RA Blakesley R.W., Touchman j.w. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J. , Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences. "; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -1- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on .its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AJ132819; CAA10791.2; -. 

DR EMBL; AF093119; AAC62108.1; -. 

DR EMBL; AF109121; AAF65188.1; 

DR EMBL; BC010456; AAH1 0456.1; -. 

DR HSSP; P35555; 1EMN. 

DR Genew; HGNC:3219; EFEMP2 . 

DR MIM; 604633; -. 

DR GO; GO: 0005604; C: basement membrane; TAS. 

DR GO; GO:0005201; F: extracellular matrix structural constituent; TAS. 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro ; IPR001881; EGF_Ca. 

DR InterPro; I PRO 062 09; EGF_like. 

DR InterPro; I PRO 014 91; Thrmbomoduln . 

DR Pfam; PF00008; EGF; 4. 

DR PRINTS; PR00907; THRMBOMODULN. 

DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE; PS 00 010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS 00 022; EGF_1; FALSE_NEG. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 6. 

KW Repeat; EGF-like domain; Calcium-binding; Glycoprotein; Signal. 

FT SIGNAL 1 25 POTENTIAL. 

FT CHAIN 26 443 EGF- CONTAINING FIBULIN -LIKE EXTRACELLULAR 

FT MATRIX PROTEIN 2. 

FT DOMAIN 36 81 EGF-LIKE 1, DIVERGENT. 

FT DOMAIN 123 163 EGF-LIKE 2, CALCIUM-BINDING (POTENTIAL). 

FT DOMAIN 164 202 EGF-LIKE 3, CALCIUM-BINDING (POTENTIAL). 

FT DOMAIN 203 242 EGF-LIKE 4, CALCIUM-BINDING (POTENTIAL). 



FT 


DOMAIN 


243 


282 


EGF-LIKE 5, CALCIUM- BINDING (POTENTIAL) 


FT 


DOMAIN 


283 


328 


EGF-LIKE 6, CALCIUM- BINDING (POTENTIAL) 


FT 


DISULFID 


127 


140 


BY 


SIMILARITY. 




FT 


DISULFID 


134 


149 


BY 


SIMILARITY. 




FT 


DISULFID 


151 


162 


BY 


SIMILARITY. 




FT 


DISULFID 


168 


177 


BY 


SIMILARITY. 




FT 


DISULFID 


173 


186 


BY 


SIMILARITY. 




FT 


DISULFID 


188 


201 


BY 


SIMILARITY. 




FT 


DISULFID 


207 


217 


BY 


SIMILARITY. 




FT 


DISULFID 


213 


226 


BY 


SIMILARITY. 




FT 


DISULFID 


228 


241 


BY 


SIMILARITY. 




FT 


DISULFID 


247 


258 


BY 


SIMILARITY. 




FT 


DISULFID 


254 


267 


BY 


SIMILARITY. 




FT 


DISULFID 


269 


281 


BY 


SIMILARITY. 




FT 


DISULFID 


287 


300 


BY 


SIMILARITY. 




FT 


DISULFID 


294 


309 


BY 


SIMILARITY. 




FT 


DISULFID 


315 


327 


BY 


SIMILARITY. 




FT 


CARBOHYD 


198 


198 


N- LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


394 


394 


N- LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CONFLICT 


5 


5 


A - 


■> T (IN REF. 1) . 




FT 


CONFLI CT 


44 


51 


EWDPDSQH -> TQTAN (IN 


REF. 2) . 


FT 


CONFLICT 


103 


111 


AQHPNPCPP -> VNTQPLPT 


(IN REF. 2) . 


FT 


CONFLICT 


294 


294 


C - 


■> W (IN REF. 2) . 




FT 


CONFLICT 


354 


356 


RSV -> AER (IN REF. 2) 




FT 


CONFLICT 


355 


355 


S - 


■> R (IN REF. 3) . 




SQ 


SEQUENCE 


443 AA; 


49391 


MW; 9E9AC2393780D3B8 CRC64 ; 



Query Match 50.9%; Score 1289; DB 1; Length 443; 

Best Local Similarity 49.9%; Pred. No. 1.7e-88; 

Matches 226; Conservative 74; Mismatches 131; Indels 22; Gaps 5; 

Qy 1 MPGI KRI LTVTI LALCLPSPGNAQ AQCTNGFDLDRQSGQCLDI DECRTI PEACRG 55 

= 11 = = :| I ii - = =11 = 1 = = I I I l = = ll I I E I I I = f 

Db 8 L PGS LLLWALLLLLLGS AS PQDS EE PDS YTE CTDGYEWD PDSQHCRDVNE CLT I P EACKG 67 

Qy 56 DMMCVNQNGGYLCI PRTNP VYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLI CRF 115 

= 1 M Mi- ' | : II I II II I 
Db 68 EMKCI NHYGGYLCLPRSAAVI NDLHG EGP-PPPVPPAQHPN PCPP 111 

Qy 116 GYQMDE SNQCVDVDECATDSHQCNPTQ I C I NTEGGYTCS CTDG YWLLEGQCLD I DECRYG 175 

11= != = MINIM I I 1=1 I I I I hi Ml = = I = E i 1 1 1 1 1 

Db 112 GYEPDDQDSCVDVDECAQALHDCRPSQDCHNLPGSYQCTCPDGYRKIGPECVDIDECRYR 171 

Qy 176 YCQQLCAN VPGS YS CTCN PGFTLNEDGRS CQDVNE CATEN PC VQTCVNTYGSF I CRCDPG 235 

III I 1=111= I I III I : III Mill II I I hlhhlll I 

Db 172 YCQHRCVNLPGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCHQG 231 

Qy 236 YELEEDGVHCSDMDECS F S EFLCQHE CVNQPGTYFCS C P PGY I LLDDNRS CQD I NECEHR 295 

Ml II 111 = 1111 = 1 =111= Mhll = I II II II I I I I I I 
Db 232 YELHRDGFSCSDIDECSYSSYLCQYRCVNEPGRFSCHCPQGYQLL-ATRLCQDIDECESG 290 

Qy 2 96 NHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDV 355 

I h III I ll = = E = l II IIE = = = [ = IIMEII El 11 = 11 =l = = l I = 

Db 291 AHQCSEAQTCVNFHGGYRCVDTNRCVEPYIQVSENRCLCPASNPLCREQPSSIVHRYMTI 350 

Qy 356 VSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPISATLVMTRPI KGPRE 415 

I III ! Ih I lllll II : II I HI II ||: ||: |||| 



Db 351 TSERS VPADVFQI OATS VYPGAYNAFQI RAGNSQGDFYI RQI NNVSAMLVLARPVTGPRE 410 



Qy 416 I QLDLEM I TVNTVI NFRGSS VI RLR 1 Y VSQYP F 448 

llllhhh::-! 111 = 13 -I I I 
Db 411 YVLDLEMVTMNSLMSYRASSVLRLTVFVGAYTF 443 



RESULT 6 
FBL4_MOUSE 

ID FBL4_MOUSE STANDARD; PRT; 443 AA. 

AC Q9WVJ9 ; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE EGF-containing fibulin-like extracellular matrix protein 2 precursor 

DE (Fibulin-4) (FIBL-4) (Mutant p53 binding protein 1) . 

GN EFEMP2 OR FBLN4 OR MBP1 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota,- Metazoa; Chorda ta; Crania t a; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; 

RX MEDLINE=99308589; PubMed=l 038 0882 ; 

RA Gallagher W.M., Argentini M . , Sierra V., Bracco L. , Debussche L. , 

RA Conseiller E.; 

RT "MBP1: a novel mutant p53-specific protein partner with oncogenic 

RT properties . " ; 

RL Oncogene 18 :3608-3616 (1999) . 

CC -!- SUBUNIT: BINDS PREFERENTIALLY TO P53 MUTANTS. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY : Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AF104223; AAD45219.1; -. 

DR HSSP; P00736; 1APQ. 

DR MGD; MGI : 1891209 ; Ef emp2 . 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca. 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; IPR001491; Thrmbomoduln . 

DR Pfam; PF00008; EGF; 4. 

DR PRINTS; PRO 09 07; THRMBOMODULN. 

DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS 00 02 2; EGF_1 ; FALSE_NEG. 

DR PROSITE; PS 01 18 6; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF__CA; 6. 



KW 


Repeat; EGF-like 


domain; 


Calcium-binding; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


25 


POTENTIAL . 




FT 


CHAIN 


26 


443 


EGF- CONTAINING FIBULIN-LIKE EXTRACELLULAR 


FT 








MATRIX PROTEIN 2. 




FT 


DOMAIN 


36 


81 


EGF-LIKE 1, DIVERGENT. 


FT 


DOMAIN 


123 


163 


EGF-LIKE 2, CALCIUM 


-BINDING (POTENTIAL) . 


FT 


DOMAIN 


164 


202 


EGF-LIKE 3, CALCIUM- 


-BINDING (POTENTIAL) . 


FT 


DOMAIN 


203 


242 


EGF-LIKE 4, CALCIUM' 


-BINDING (POTENTIAL) . 


FT 


DOMAIN 


243 


282 


EGF-LIKE 5, CALCIUM' 


-BINDING (POTENTIAL) . 


FT 


DOMAIN 


283 


328 


EGF-LIKE 6, CALCIUM' 


-BINDING (POTENTIAL) . 


FT 


DISULFID 


127 


140 


BY 


SIMILARITY. 




FT 


DISULFID 


134 


149 


BY 


SIMILARITY. 




FT 


DISULFID 


151 


162 


BY 


SIMILARITY. 




FT 


DISULFID 


168 


177 


BY 


SIMILARITY. 




FT 


DISULFID 


173 


186 


BY 


SIMILARITY. 




FT 


DISULFID 


188 


201 


BY 


SIMILARITY. 




FT 


DISULFID 


207 


217 


BY 


SIMILARITY. 




FT 


DISULFID 


213 


226 


BY 


SIMILARITY. 




FT 


DISULFID 


228 


241 


BY 


SIMILARITY. 




FT 


DISULFID 


247 


258 


BY 


SIMILARITY. 




FT 


DISULFID 


254 


267 


BY 


SIMILARITY. 




FT 


DISULFID 


269 


281 


BY 


SIMILARITY. 




FT 


DISULFID 


287 


300 


BY 


SIMILARITY. 




FT 


DISULFID 


294 


309 


BY 


SIMILARITY. 




FT 


DISULFID 


315 


327 


BY 


SIMILARITY. 




FT 


CARBOHYD 


198 


198 


N- LINKED (GLCNAC . . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


394 


394 


N- LINKED (GLCNAC. . 


. ) (POTENTIAL) . 


SQ 


SEQUENCE 


443 AA 


; 49425 


MW; 4969C0328A23DD88 CRC64 ; 



Query Match 50.6%; Score 1282; DB 1; Length 443; 

Best Local Similarity 49.9%; Pred. No. 5.7e-88; 

Matches 226; Conservative 70; Mismatches 135; Indels 22; Gaps 4; 

Qy 1 M PG I KR I LTVT I LALCLP S PGNAQ AQCTNGFDLDRQSGQCLDIDECRTI PEACRG 55 

= 11 : :| | || : : Hhl- I i I l-M IMMhl 

Db 8 LPGSLLLWAFLLLLLGAASPQDPEEPDSYTECTDGYEWDADSQHCRDVNECLTIPEACKG 67 

Qy 56 DMMCVNQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRF 115 

= 1 hi II llh lh i : II I II H I 

Db 68 EMKCINHYGGYLCLPRSAAVISDLHG EGPPPPAA HAQQPNPCPQ 111 

Qy 116 GYQMDESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYG 175 

lh I I IMIIII I I hi I I I I hi III : :hlllllll 

Db 112 GYEPDEQESCVDVDECTQALHDCRPSQDCHNLPGSYQCTCPDGYRKIGPECVDIDECRYR 171 

Qy 176 YCQQLCANVPGS YS CTCNPGFTLNEDGRS CQDVNE CATENP CVQTCVNTYGS F I CRCDPG 23 5 

Ml I 1 = 111= I I IN I : III Mill II I E hlhhllh I 

Db 172 YCQHRCVNLPGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCNQG 231 

Qy 236 YELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECEHR 295 

III II Mhlll H I llhll : I || || || | I' l l 

Db 232 YELHRDGFS CS D I DE CGYS S YLCQYRCVNE PGRFS CHCPQGYQLL - ATRLCQD I DECETG 290 

Qy 296 NHTCNLQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMCPAENPGCRDQPFTILYRDMDV 355 

I I I I I 1 I I I 1 h : : I I I I h I I I I I I h I | : h : | | : 

Db 291 AHQCSEAQTCVNFHGGYRCVDTNRCVEPYVQVSDNRCLCPASNPLCREQPSSIVHRYMSI 350 



Qy 356 VSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTG P I SATLVMTRP I KG PRE 415 

I lllllhlhllh Mill llhlll ; ; : 1 1 I ! MM 

Db 351 TSERSVPADVFQI QATS VYPGAYNAFQI RSGNTQGDFYI RQI NNVSAMLVLARPVTGPRE 410 

Qy 416 IQLDLEMITVNTVINFRGSSVIRLRIYVSQYPF 448 

MMhhh::::| llhll =M I I 
Db 411 YVLDLEMVTMNSLMSYRASSVLRLTVFVGAYTF 443 



RESULT 7 
FBL3_RAT 

ID FBL3_RAT STANDARD; PRT; 493 AA. 

AC 035568; 

DT 16-OCT-2001 (Rel . 40, Created} 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE EGF- containing fibulin-like extracellular matrix protein 1 precursor 

DE (Fibulin-3) (FIBL-3) (T16 protein) . 

GN EFEMP1 OR FBLN3 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lung; 

RX MEDLINE=974 15782 ; PubMed=9268694 ; 

RA Ozaki T., Kondo K. , Nakamura Y., Ichimiya S., Nakagawara A., 

RA Sakiyama S . ; 

RT "Interaction of DA41, a DAN-binding protein, with the epidermal growth 

RT factor-like protein, S(l-5). n ; 

RL Biochem. Biophys. Res. Commun. 237:245-250(1997). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL out station - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; D89730; BAA22265.1; 

DR PIR; JC5621; JC5621. 

DR HSSP; P35555; 1EMN. 

DR I nt erPro ; I PRO 00152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF00008; EGF; 3. 

DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS 00 022; EGF_1; FALSE__NEG. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF__CA ; 6. 

KW Repeat; EGF-like domain; Calcium-binding; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


17 


POTENTIAL . 


FT 


CHAIN 


18 


493 


EGF- CONTAINING FIBULIN-LIKE 


FT 








MATRIX PROTEIN 1. 


FT 


DOMAIN 


26 


71 


EGF -LIKE 1, DIVERGENT. 


FT 


DOMAIN 


173 


213 


EGF -LIKE 2, CALCIUM -BINDING 


FT 


DOMAIN 


214 


253 


EGF -LIKE 3, CALCIUM-BINDING 


FT 


DOMAIN 


254 


293 


EGF -LIKE 4, CALCIUM-BINDING 


FT 


DOMAIN 


294 


333 


EGF -LIKE 5, CALCIUM-BINDING 


FT 


DOMAIN 


334 


378 


EGF -LIKE 6, CALCIUM-BINDING 


FT 


DISULFID 


111 


190 


BY 


SIMILARITY. 


FT 


DISULFID 


184 


199 


BY 


SIMILARITY. 


FT 


DISULFID 


201 


212 


BY 


SIMILARITY. 


FT 


DISULFID 


218 


228 


BY 


SIMILARITY. 


FT 


DISULFID 


224 


237 


BY 


SIMILARITY. 


FT 


DISULFID 


239 


252 


BY 


SIMILARITY. 


FT 


DISULFID 


258 


268 


BY 


SIMILARITY. 


FT 


DISULFID 


264 


277 


BY 


SIMILARITY. 


FT 


DISULFID 


279 


292 


BY 


SIMILARITY. 


FT 


DISULFID 


298 


309 


BY 


SIMILARITY. 


FT 


JJ1 oUJ-ir ID 


305 


318 


BY 


SIMILARITY. 


FT 


DISULFID 


320 


332 


BY 


SIMILARITY. 


FT 


DISULFID 


338 


350 


BY 


SIMILARITY. 


FT 


DISULFID 


344 


359 


BY 


SIMILARITY. 


FT 


DISULFID 


365 


377 


BY 


SIMILARITY. 


SQ 


SEQUENCE 


493 AA; 


54596 


MW; 22DAFD70BACF1CA5 CRC64 ; 



EXTRACELLULAR 



(POTENTIAL) . 
(POTENTIAL) . 
(POTENTIAL) . 
(POTENTIAL) . 
(POTENTIAL) . 



Query Match 43.8%; Score 1109.5; DB 1; Length 493; 

Best Local Similarity 42.3%; Pred. No. 3.8e-75; 

Matches 206; Conservative 73; Mismatches 161; Indels 47; 



Qy 



Db 



Gaps 



7 I LTVTI LALCLPS PGNAQAQCTNGFDLDRQSGQCLD I DECRTI P EACRGDMMC VNQNGG Y 6 6 

I : Mhl- I I! Mill :|:||:| | I , I 

9 MLTLALVKSQVTEETITYTQCTDGYEWDPVRQQCKDIDECDIVPDACKGGMKCVNHYGGY 68 



Qy 67 LCI PRTNPVYRGPYSNPYSTPYS GPYPAAAPPLSAP 102 

MHH : II = | : |:| == [ 

Db 69 LCLPKTAQI I VNNEQPQQETPAAEASSGAATGTIAARSMATSGVI PGGGFI ASATAVAGP 128 

Qy 103 NYPT 1 SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQ 142 

I I : I I I : III hill = : I I I 

Db 129 EVQTGRNNFVIRRNPADPQRI PSNPSHRIQCAAGYEQSEHNVCQDI DECTSGTHNCRLDQ 188 

Qy 143 I C I NTEGGYTCS CTDGYWLLEGQCLD I DECRY - GYCQQLCAN VPGS YS CTCNPGFTLNED 201 

= 111 I =11 I II I || | | | |||: | HIM | : 

Db 189 VCINLRGSFTCHCLPGYQKRGEQCVDIDECSVPPYCHQGCVNTPGSFYCQCNPGFQLAAN 24 8 

Qy 202 GRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHE 261 

= 1 1 = 111 I I I I I Illlhl: JEM I -I hill I :||h: 
Db 249 NYTCVDI NECDASNQCAQQC YN I LGS F I CQCNQGYELSSDRLNCED I DECRTS S YLCQYQ 308 

Qy 262 CVNQPGT YFCS C PPGY I LLDDNRS CQD I NECEHRNHTCNLQQTCYNLQGGF KC I D P I RCE 321 

Mhll : I II II ' T I | : h| \\ - \ I 

Db 309 CVNEPGKFSCMCPQGYQW-RSRTCQDINECETTNE-CREDEMCWNYHGGFRCYPQNPCQ 366 

Qy 322 EPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGAYYIF 381 

Hh hllhll I III I =hh h= I lllhlilhllll I I 

Db 367 DPYVLTSENRCVCPVSNTMCRDVPQSI VYKYMNIRSDRSVPSDI FQI QATTI YANTINTF 426 



Qy 382 QIKSGNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRI 441 

'MINI H f : I If hi! ||: : : |||| : llhlh- II llh|| | 
D b 4 2 7 RI KSGNENGEFYLRQTS PVSAMLVLVKSLTGPREHI VGLEMLTVS S I GTFRTS S VLRLT I 48 6 

Qy 442 YVSQYPF 448 

I : I ■ 
Db 487 IVGPFSF 493 



RESULT 8 . 
FBL3_HUMAN 

ID FBL3_HUMAN STANDARD; PRT; 493 AA 

AC Q12805; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE EGF- containing fibulin-like extracellular matrix protein 1 precursor 
DE (Fibulin-3) (FIBL-3) (Fibri 11 in-like protein) (Extracellular protein 
DE Sl-5) . 

GN EFEMP1 OR FBLN3 OR FBNL. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCB I _Tax I D= 9 6 0 6 ; 
RN [1] 

RP SEQUENCE FROM N.A., AND POSSIBLE ALTERNATIVE SPLICING 

RC TISSUE=Skin ; 

RX MEDLINE=95097983; PubMed=7799918 ; 

RA Lecka-Czernik B . , Lumpkin C.K. Jr., Goldstein S.; 

RT "An overexpressed gene transcript in senescent and quiescent human 

RT fibroblasts encoding a novel protein in the epidermal growth factor- 

RT like repeat family stimulates DNA synthesis."; 

RL Mol. Cell. Biol. 15:120-128(1995). 
RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97001163; PubMed=88 12496 ; 

RA Ikegawa S., Toda T. , Okui K. , Nakamura Y. ; 

RT "Structure and chromosomal assignment of the human Sl-5 gene (FBNL) 

RT that is highly homologous to fibrillin."; 

RL Genomics 35:590-592(1996). 
RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 0068 041; PubMed=10601734 ; 

RA Giltay R. , Timpl R., Kostka G. ; 

RT "Sequence, recombinant expression and tissue localization of two novel 

RT extracellular matrix proteins, fibulin-3 and fibulin-4."; 

RL Matrix Biol. 18:469-480(1999). 

RN [4] 

RP VARIANT DHRD/MLVT TRP-345, AND VARIANT PHE-220^ 

RX MEDLINE=992 95941; PubMed=10369267; 

RA Stone E.M., Lotery A.J., Munier F.L. , Heon E., Piguet B., Guymer R.H. , 

RA Vandenburgh K. , Cousin P., Nishimura D., Swiderski R. E. , Silvestri G. \ 

RA MackeyD.A., Hagerman G . S . , Bird A. C, Shef field V. C. , 

RA Schorderet D.F.; 

RT "A single EFEMP1 mutation associated with both malattia Leventinese 

RT and Doyne honeycomb retinal dystrophy."; 



RL Nat. Genet. 22:199-202(1999). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing,- Named isoforms=4; 

CC Comment =Experimental confirmation may be lacking for some 

CC isoforms; 

CC Name=l; 

CC IsoId=Q12805-l; Sequence=Di splayed ; 

CC Name=2 ; 

CC IsoId=Q12805-2; Sequence=VSP_001392 ; 

CC Name=3 ; 

CC IsoId=Q12805-3; Sequence=VSP_001393 ; 

CC Name=4 ; 

CC IsoId=Q12805-4; Sequence=VSP_001394 ; 

CC -!- DISEASE: DEFECTS IN EFEMP1 ARE A CAUSE OF DOYNE HONEYCOMB RETINAL 
CC DYSTROPHY (DHRD) ALSO KNOWN AS MALATTIA LEVENTINESE (MLVT' OR ML) , 

CC AN AUTOSOMAL DOMINANT DISEASE CHARACTERIZED BY YELLOW -WHITE 

CC DEPOSITS KNOWN AS DRUSEN THAT ACCUMULATE BENEATH THE RETINAL 

CC PIGMENT EPITHELIUM. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -I- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright- It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL out station - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; U03877; AAA65590.1; -. 

DR HSSP; P35555; 1EMN. 

DR Genew; HGNC:3218; EFEMP1 . 

DR MIM; 601548; -. 

DR MIM; 126600; -. 

DR GO; GO: 0005578; C: extracellular matrix; TAS. 

DR GO; GO: 0007601; P -.vision; TAS. 

DR InterPro; IPR000152; Asxjhydroxyl . 

DR InterPro ; I PRO 018 81; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF00008; EGF; 3. 

DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS00022; EGF_1; FALSE_NEG . 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 6. 

KW Repeat; EGF -like domain; Calcium -binding; Glycoprotein; Signal ; 

KW Disease mutation; Polymorphism; Alternative splicing. 

FT SIGNAL 1 17 POTENTIAL. 

FT CHAIN 18 493 EGF -CONTAINING FIBULIN -LIKE EXTRACELLULAR 

FT MATRIX PROTEIN 1. 

FT DOMAIN 26 71 EGF -LIKE 1, DIVERGENT. 

FT DOMAIN 173 213 EGF -LIKE 2, CALCIUM-BINDING (POTENTIAL) . 

FT DOMAIN 214 253 EGF -LIKE 3, CALCIUM-BINDING (POTENTIAL) . 

FT DOMAIN 254 293 EGF -LIKE 4, CALCIUM-BINDING (POTENTIAL). 

FT DOMAIN 294 333 EGF -LIKE 5, CALCIUM-BINDING (POTENTIAL). 

FT DOMAIN 334 378 EGF -LIKE 6, CALCIUM-BINDING (POTENTIAL). 



FT 


DISULFID 


177 


190 


BY SIMILARITY. 


FT 


DISULFID 


184 


199 


BY SIMILARITY. 


FT 


DISULFID 


201 


212 


BY SIMILARITY. 


FT 


DISULFID 


218 


228 


BY SIMILARITY. 


FT 


DISULFID 


224 


237 


BY SIMILARITY. 


FT 


DISULFID 


239 


252 


BY SIMILARITY. 


FT 


DISULFID 


258 


268 


BY SIMILARITY. 


FT 


DISULFID 


264 


277 


BY SIMILARITY. 


FT 


DISULFID 


279 


292 


BY SIMILARITY. 


FT 


DISULFID 


298 


309 


BY SIMILARITY. 


FT 


DISULFID 


305 


318 


BY SIMILARITY. 


FT 


DISULFID 


320 


332 


BY SIMILARITY. 


FT 


DISULFID 


338 


350 


BY SIMILARITY. 


FT 


DISULFID 


344 


359 


BY SIMILARITY. 


FT 


DISULFID 


365 


377 


BY SIMILARITY. 


FT 


CARBOHYD 


249 


249 


N- LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


VARSPLIC 


1 


8 


Missing (in isoform 2) . 


FT 








/FTId=VSP_001392. 


FT 


VARSPLIC 


58 


58 


Missing (in isoform 3) . 


FT 








/FTId=VSP_001393. 


FT 


VARSPLIC 


106 


106 


Missing (in isoform 4) . 


FT 








/FTId=VSP 001394. 


FT 


VARIANT 


220 


220 


I -> F. 


FT 








/FTId=VAR 009512. 


FT 


VARIANT 


345 


345 


R -> W (IN MVLT) . 


FT 








/FTId=VAR_009513. 


SQ 


SEQUENCE 


4 93 AA; 


54640 


MW; 128CA5ED140DF414 CRC64 ; 



Query Match 43.5%; Score 1101; DB 1; Length 4 93; 

Best Local Similarity 42.0%; Pred. No. 1.6e-74; 

Matches 208; Conservative 69; Mismatches 162; Indels 56; Gaps 8; 

Qy 6 R I LTVT I LALCLP S PGNAQ AQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCV 60 

: I :|:| I I = : I I I II Mill :|>||:| I II 

Db 3 KALFLTMLTLALVKSQDTEETITYTQCTDGYEWDPVRQQCKDIDECDIVPDACKGGMKCV 62 

Qy 61 NQNGGYLCI PRTNPVY RGPYSNPY STPYSGPYPAA 95 

I r -i i i i ill 

Db 63 NHYGGYLCLPKTAQI IVNNEQPQQETQPAEGTSGATTGWAASSMATSGVLPGGGFVASA 122 

Qy 96 APPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATD 134 

II hi I : I I h III hill 
Db 123 AAVAGPEMQTGRNNFVIRRNPADPQRI PSNP- - SHRI QCAAGYEQSEHNVCQDIDECTAG 180 

Qy 135 S HQCNPTQ I C I NTEGGYTCS CTDGYWLLEGQCLD I DECRY -GYCQQLCANVPGS YS CTCN 193 

:| I hill I : I I II I I ■ I I lllll Mh I I 

Db 181 THNCRADQVCINLRGSFACQCPPGYQKRGEQCVDIDECTIPPYCHQRCVNTPGSFYCQCS 240 

Qy 194 PGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSF 253 

. Ill I ; :E hill lllll llllhh 1 1 1 1 I : hill 

Db 241 PGFQLAANNYTCVDINECDASNQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRT 300 

Qy 254 S EFLCQHECVNQPGT YFCS CP PGY I LLDDNRS CQD I NECEHRNHTCNLQQTCYNLQGGF K 313 

I :|||::|Eh|| : | || || :: - Ml II : hi Mh 
Db 3 01 SSYLCQYQCVNEPGKFSCMCPQGYQW-RSRTCQDINECETTNE - CREDEMCWNYHGGFR 358 



Qy 



314 CIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTR 373 



I Ih III: | ||: | :|:|: | : | | | | | : | | | | = | | | | 

Db 359 CYPRNPCQDPYILTPENRCVCPVSNAMCRELPQSIVYKYMSIRSDRSVPSDIFQIQATTI 418 

QY 374 YPGAYYI PQI KSGNEGREFYMRQTGPISATLVMTRPI KGPREI QLDLEMITVNTVINFRG 433 

! hllllll llhlll I |h : : I I I I :M M - || 

Db 419 YANTINTFRI KSGNENGEFYLRQTSPVSAMLVLVKSLSGPREHI VDLEMLTVSS IGTFRT 478 

Qy 434 SSVIRLRIYVSQYPF 448 

Ilhll I I : I 
Db 479 SSVLRLTIIVGPFSF 493 



RESULT 9 
FBL2_M0USE 

ID FBL2_M0USE STANDARD ; PRT; 1221 AA. 

AC P37889; Q9WUI2; 

DT Ol-OCT-1994 (Rel. 30, Created) 

DT Ol-OCT-1994 (Rel. 30, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-2 precursor. 

GN FBLN2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A., AND SEQUENCE OF 27-35. 

RC TISSUE=Fibroblast; 

RX MEDLINE=94 064787; PubMed=8245130 ; 

RA Pan T.-C, Sasaki T., Zhang R.-z., Faessler R. , Timpl R., Chu M.-L.; 

RT "Structure and expression of fibulin-2, a novel extracellular matrix 

RT protein with multiple EGF-like repeats and consensus motifs for 

RT calcium binding. " ; 

RL J. Cell Biol. 123:1269-1277(1993). 

RN [2] 

RP SEQUENCE FROM N.A. , AND ALTERNATIVE SPLICING. 

RX MEDLINE=99337686; PubMed=104 0695 6; 

RA Graessel S., Sicot F.-X., Gotta S., Chu M.-L.; 

RT "Mouse fibulin-2 gene. Complete exon-intron organization and promoter 

RT characterization. " ; 

RL Eur. J. Biochem. 263:471-477(1999). 

RN [3] 

RP DEVELOPMENTAL STAGE . 

RX PubMed=8850569; 

RA Zhang H.-Y., Timpl R. , Sasaki T. , Chu M.-L., Ekblom P.; 

RT "Fibulin-1 and fibulin-2 expression during organogenesis in the 

RT developing mouse embryo . " ; 

RL Dev. Dyn. 205:348-364(1996). 

RN [4] 

RP BINDING TO LAMA2 . 

RX PubMed=1002282 9; 

RA Talts J.F., Andac Z., Goehring W., Brancaccio A., Timpl R. ; 

RT "Binding of the G domains of laminin alphal and alpha2 chains and 

RT perlecan to heparin, sul fat ides, alpha -dystroglycan and several 

RT extracellular matrix proteins."; 

RL EMBO J. 18:863-870(1999). 

RN [5] 



RP DOWN-REGULATION BY GLUCOCORTICOIDS. 

RX PubMed= 11737251; 

RA Gu Y.-C, Talts J. P., Gullberg D., Timpl R. , Ekblom M. ; 

RT "Glucocorticoids down-regulate the extracellular matrix proteins 

RT fibronectin, fibulin-1 and fibulin-2 in bone marrow stroma."; 

RL Eur. J. Haematol. 67:176-184(2001). 

CC -l- FUNCTION: ITS BINDING TO FIBRONECTIN AND SOME OTHER LIGANDS IS 
CC CALCIUM DEPENDENT. 

CC -!- SUBUNIT: Homotrimer; disulf ide-linked. Interacts with LAMA2 . 

CC -!- SUBCELLULAR LOCATION: Secreted; extracellular matrix. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event =Alternative splicing; Named isoforms=2; 

CC Comment =Additional isoforms seem to exist; 

CC Name=l; 

CC IsoId=P37889-l; Sequence=Di splayed; 

CC Name=2 ; Synonyms=EGF3 -less ; 

CC IsoId=P37889-2; Sequence=VSP_001391 ; 

CC -!- TISSUE SPECIFICITY: Component of both basement membranes and other 
CC connective tissues. 

CC DEVELOPMENTAL STAGE: The differential expression of the fibulin 

CC family contributes to the formation of molecularly distinct 

CC extracellular matrices already during early developmental stages 

CC of a large number of tissues. 

CC -!- INDUCTION: Glucocorticoids suppressed mRNA expression and protein 
CC synthesis . 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 3 anaphylatoxin-like domains. 

CC -!- SIMILARITY: Contains 11 EGF-like domains. 



CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 



DR 


EMBL; 


X75285; CAA53040.1; 




DR 


EMBL; 


AF135253; 


AAD34456. 


1 




DR 


EMBL; 


AF135239; 


AAD34456. 


1 


• JOINED. 


DR 


EMBL; 


AF13524 0; 


AAD34456. 


1 


• JOINED. 


DR 


EMBL ; 


AF135241; 


AAD34456. 


1 


JOINED. 


DR 


EMBL; 


AF135242; 


AAD34456. 


1 


JOINED. 


DR 


EMBL; 


AF135243; 


AAD34456. 


1 


JOINED. 


DR 


EMBL; 


AF135244; 


AAD34456. 


1 


JOINED. 


DR 


EMBL; 


AF135245; 


AAD34456 . 


1 


JOINED. 


DR 


EMBL ; 


AF135246; 


AAD34456 . 


1 


JOINED. 


DR 


EMBL; 


AF135247; 


AAD34456 . 


1 


JOINED. 


DR 


EMBL; 


AF135248; 


AAD34456. 


1 


JOINED. 


DR 


EMBL; 


AF135249; 


AAD34456. 


1 


JOINED. 


DR 


EMBL; 


AF135250; 


AAD34456. 


1 


JOINED. 


DR 


EMBL; 


AF135251; 


AAD34456. 


1 


JOINED. 


DR 


EMBL; 


AF135252; 


AAD34456. 


1 


JOINED. 



DR PIR; A49457; A49457. 

DR HSSP; P00736; 1APQ. 

DR MGD; MGI: 95488; Fbln2 . 

DR InterPro; I PRO 00 020; Anaphylatoxin . 



DR 


InterPro; 


IPR000152; Asx hydroxyl . 




DR 


Inter Pro ; 


IPR0018 


81; EGF 


Ca. 








DR 


InterPro ; 


IPR006209; EGF 


like. 






DR 


Pfam; PF01821; ANATO; 2. 










DR 


Pfam; PF00008; EGF; 6. 










DR 


SMART; SM00104; ANATO; 3. 










DR 


SMART; SM00179; EGF CA; 9 










DR 


PROSITE; 


PS00010; 


ASX_HYDROXYL; 


5. 




DR 


PROSITE; 


PS01177; 


ANAPHYLATOXIN 


1; 3. 




DR 


PROSITE; 


PS01178; 


ANAPHYLATOXIN' 


2; 3. 




DR 


PROSITE; 


PS00022; 


EGF 1; 


FALSE NEG. 




DR 


PROSITE; 


PS01186; 


EGF 2; 


5. 








DR 


PROSITE; 


PS01187; 


EGF CA; 


10 








KW 


Signal; Glycoprotein; Extracellular matrix; Plasma; EGF -like domain ; 


KW 


Calcium-binding; . 


Alternative 


splicing; Repeat. 




FT 


SIGNAL 


1 


26 










FT 


CHAIN 


27 


1221 
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SQ 


SEQUENCE 


1221 


AA; 131818 


MW; 


87DB2A10A8FDC45F 


CRC64 ; 


Query Match 




29.0%; 


Score 733.5; DB 1; 


Length 1221; 



Best Local Similarity 35.3%; 
Matches 146; Conservative 



Pred. No. 6.3e-47; 
60; Mismatches 149; 



Indels 5.9; Gaps 



Qy 23 AQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSN 82 

h I >ll I I HI I :: I hi III 
Db 829 ARQRCMDGF-LQDPEGNCVDINECTSLLEPCRSGFSCINTVGSYTC 873 

Qy 83 PYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQM-DESNQCVDVDECATDSHQCNPT 141 

Ihl II : I -Nihil I hi 

Db 874 QRNPLVCGRGYHANEEGSECVDVNECETGVHRCGEG 909 

Qy 142 Q I C I NTEGGYTCS CTDGYW - - LLEGQCLD I DECR YG YCQQLCANVPGS YS CTCN PGF 196 

hi I I I I I h hhNI || | | Nil hi II 

Db 910 QLCYNL PGS YRCDCKPGFQRDAFGRTC I D VNECWVS PGRLCQHTCENT PGS YRCS CAAGF 969 

Qy 197 TLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECS-FSE 255 

i I hjllll I MM Nh I I Eh! III hhllh ; . 
Db 970 LLAADGKHCEDVNECETRR-CSQECANIYGSYQCYCRQGYQLAEDGHTCTDIDECAQGAG 1028 

Qy 256 FLCQHECVNQPGTYFCSCP- PGYI LLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKC 314 

N IN Ihl I II : , | I Nhhll hi 

Db 1029 I LCTFRCVNVPGS YQCACP EQGYTMMANGRS CKDLDECALGTHNCSEAETCHN I QGS FRC 1088 

Qy 315 IDPIRCEEPYLRISDNRCMCPAENPGCRD QPFTILYRDMDWSGRSVPADI FQM 368 

■ I hhl N I hi I I = :: =| ||[ II- 

Db 1089 L-RFDCPPNYVRVSQTKC ERTTCQDI TECQTSPARI THYQLNFQTGLLVPAHI FRI 1143 

Qy 369 QATTRYPGAY Y I FQI KSGNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDLEM 422 

M I III I h = : M = Ih 1 E I I 



Db 1144 GPAPAFAGDTI SLTI TKGNEEGYFVTRRLNAYTGWSLQRSVXtEPRDFALDVEM 1197 



RESULT 10 
FBL1_CERAE 



ID FBL1_CERAE STANDARD; PRT; 598 AA. 

AC Q8MJJ9 ; 

DT 15-SEP-2003 (Rel. 42, Created) 

DT 15-SEP-2 003 (Rel. 42, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-lC (Fragment) . 

GN FBLN1 . 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercopithecus. 

OX NCBI_TaxID=9534 ; 

RN [1] 

RP SEQUENCE FROM N.A., AND INTERACTION WITH DTR . 

RX PubMed=11846885; 

RA Brooke J.S., Cha J.-H., Eidels L. ; 

RT "Latent transforming growth factor beta -binding protein- 3 and 

RT fibulin-lC interact with the extracellular domain of the 

RT heparin-binding EGF-like growth factor precursor."; 

RL BMC Cell Biol. 3:2-2(2002). 

CC -!- FUNCTION: Incorporated into f ibronectin-containing matrix fibers. 

CC May play a role in cell adhesion and migration along protein 

CC fibers within the extracellular matrix (ECM) . Could be important 

CC for certain developmental processes and contribute to the 

CC supramolecular organazation of ECM architecture, in particular to 

CC those of basement membranes . May serve to anchor the 

CC mature/soluble form of DTR to its fibers as it migrates through 

CC the extracellular matrix. The direct physical association with DTR 

CC may be useful in such tissue developmental processes as wound 

CC healing. 

CC -!- SUBUNIT: Interacts with iself and with various extracellular 
CC matrix components (By similarity) . Interacts with the 

CC mature/soluble form of DTR. 

CC -I- SUBCELLULAR LOCATION: Secreted; extracellular matrix. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -1- SIMILARITY: Contains at least 2 anaphylatoxin-like domains. 

CC -!- SIMILARITY: Contains 9 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. ' There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

cc 

DR EMBL; AF395659; AAM90567.1; -. 

DR InterPro; I PRO 00 020; Anaphylatoxin . 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca. 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; IPR006210; IEGF. 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 



Pfam; PF01821; ANATO; 1. 
Pfam; PF00008; EGF; 5. 
SMART; SM00104; ANATO; 1. 
SMART; SM00181; EGF; 9. 
SMART; SM00179; EGF_CA; 9. 
PROSITE; PS01177; ANAPHYLATOXIN 
PROSITE; PS01178; 
PROSITE; PS00010; 
PROSITE; PS01186; 
PROSITE; PS01187; 



ANAPH YLATOX I N_2 ; 
ASXJiYDROXYL; 4. 
EGF_2; 3. 
EGF CA; 7. 



1. 
1. 



Glycoprotein; Extracellular matrix ; 
Calcium -binding . 



Repeat; EGF-like domain; 
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FT DISULFID 471 4 83 BY SIMILARITY. 

FT CARBOHYD 14 14 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 450 450 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 454 454 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 598 AA; 65516 MW; 84 9BF018DF452B02 CRC64 ; 

Query Match 28.7%; Score 728; DB 1; Length 598; 

Best Local Similarity 34.8%; Pred. No. 8e-47; 

Matches 158; Conservative 76; Mismatches 188; Indels 32; Gaps 16; 

Qy 16 CLPSPGNAQAQ CTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PR 71 

I = : h : I i 1-1 : I MM = I I = | I I = | h 

Db 14 9 CINTVGSFRCQRDSSCGTGYEL-TEDNSCKDIDQCESGIHNCLPDFICQNTLGSFRCRPK 2 07 

Qy 72 TNPVYRGPYSNPYS TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESN 123 

I = =1 =111 :: I I M ::| 

Db 208 LQCKNGF I QDALANC I DI NECLS I VSAPCPTGHTCI NTEGS YTQKNVPNCGRGYHLNEEG 267 

Qy 124 QCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQ 178 

11 = 111 = I 1 = 1 = I : I i |h = hh:|| li ! 

Db 268 TRCDVNECAPPAEPCGKGHRCVNS PGSFRCECKTGYYFDGI SRMCVDVNECQRYPGRLCG 327 

Qy 179 QLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYEL 238 

1 I I I I 1 = 1 = j| h IIIIMMM :|| I I I 111 = I I ||:| 
Db 328 HKCENTLGSYVCSCSVGFRLSVDGRSCEDINECSS -SPCSQECANVYGSYQCYCRRGYQL 386 

Qy 239 EE - DG VHCS DMDE CS F - - S E FLCQHE CVNQ PGTYFCS CP - PGY I LLDDNRS CQD I NECEH 2 94 

= III I 1=111= =1 = hi ll== III! II I = hilihll 

Db 387 SDVDGVTCEDIDECALPTGGHICSYRCINIPGSFQCSCPASGYRLAPNGRNCQDIDECVT 446 

Qy 295 RNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRC-MCPA-ENPGCRDQPFTILYRD 352 

I h= =ll = l = lllh|: 1111=11 I || | i | | 

Db 447 GIHNCSINETCFNIQGGFRCL-AFECPENYRRSAATRCERLPCHENRECSKLPLRITYYH 505 

Qy 353 MDWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRP I KG 412 

: : II =1=1 == II I III I 1= ! I : :|:h 

Db 506 LSFPTNIQAPAWFRMGPSSAVPGDSMQLAITGGNEEGFFTTRKVSPHSGWALTKPVPE 565 

Qy 413 PREIQLDLEM- -ITVNTVINFRGSSVIRLRIYVS 444 

II = = I = = l II =1 I =1 Ml 

Db 566 PRDLLLTVKMDLYRHGTVSSF VAKLFIFVS 595 

RESULT 11 
FBL1_CHICK 

ID FBL1_CHICK STANDARD; PRT; 704 AA. 

AC 073775; 073774; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 15-SEP-2003 {Rel. 42, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-1 precursor. 

GN FBLN1 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI Taxi D= 9 031; 



RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS C AND D) . 

RC TISSUE=Embryo; 

RX MEDLINE=9912 0531; PubMed=9923656 ; 

RA Barth J.L., Argraves K.M., Roark E.F., Little CD., Argraves W.S.; 

RT "Identification of chicken and C. elegans fibulin-1 homologs and 



RT characterization of the C. elegans fibulin-1 gene."; 

RL Matrix Biol. 17:635-646(1998). 

CC -!- FUNCTION: Incorporated into f ibronectin-containing matrix fibers. 

CC May play a role in cell adhesion and migration along protein 

CC fibers within the extracellular matrix (ECM) . Could be important 

CC for certain developmental processes and contribute to the 

CC supramolecular organization of ECM architecture, in particular to 

CC those of basement membranes . 

CC -!- SUBUNIT: Interacts with iself and with various extracellular 
CC matrix components (By similarity) . 

CC -I- SUBCELLULAR LOCATION: Secreted; extracellular matrix. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=D; 

CC IsoId=073775-2; Sequence=Di splayed; 

CC Name=C; 

CC IsoId=073775-l; Sequence=VSP_007378 ; 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 3 anaphylatoxin-like domains. 

CC -!- SIMILARITY: Contains 9 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 



CC 



DR EMBL; AF051399; AAC05387.1; -. 

DR EMBL; AF0514 00; AAC05388.1; -. 

DR HSSP; P00742; 1HCG. 

DR InterPro; I PRO 00 020; Anaphyla toxin . 

DR InterPro ; I PRO 001 52; Asx_hydroxyl . 

DR InterPro; I PRO 01881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 



DR Pfam; PF01821; ANATO; 2. 

DR Pfam; PF00008; EGF; 6. 

DR SMART; SM00104; ANATO; 3. 

DR SMART; SM00181; EGF; 9. 

DR SMART; SM00179; EGF_CA; 8. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 5. 

DR PROSITE; PS01177; ANAPHYLATOXIN_l ; 1. 

DR PROS I TE ; PS 0 1 1 7 8 ; ANAPH YLATOX I N_2 ; 2 . 

DR PROSITE; PS00022; EGF_1 ; FALSE_NEG. 

DR PROSITE; PS01186; EGF_2; 3. 

DR PROSITE; PS01187; EGF_CA; 8. 

KW Signal; Alternative splicing; Glycoprotein; Extracellular matrix; 

KW Repeat; EGF-like domain; Calcium-binding. 
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280 


BY SIMILARITY. 


FT 


DISULFID 


274 


289 


BY SIMILARITY. 


FT 


DISULFID 


295 


307 


BY SIMILARITY. 


FT 


DISULFID 


313 


326 


BY SIMILARITY. 


FT 


DISULFID 


320 


335 


BY SIMILARITY. 


FT 


DISULFID 


342 


355 


BY SIMILARITY. 


FT 


DISULFID 


361 


374 


BY SIMILARITY. 


FT 


DISULFID 


368 


383 


BY SIMILARITY. 


FT 


DISULFID 


385 


398 


BY SIMILARITY. 


FT 


DISULFID 


404 


416 


BY SIMILARITY . 


FT 


DISULFID 


412 


425 


BY SIMILARITY. 


FT 


DISULFID 


427 


440 


BY SIMILARITY. 


FT 


DISULFID 


446 


455 


BY SIMILARITY. 


FT 


DISULFID 


451 


464 


BY SIMILARITY. 


FT 


DISULFID 


466 


480 


BY SIMILARITY. 


FT 


DISULFID 


486 


499 


BY SIMILARITY. 


FT 


DISULFID 


495 


508 


BY SIMILARITY. 


FT 


DISULFID 


510 


524 


BY SIMILARITY. 


FT 


DISULFID 


53 0 


543 


BY SIMILARITY. 


FT 


DISULFID 


537 


552 


BY SIMILARITY. 


FT 


DISULFID 


557 


578 


BY SIMILARITY. 


FT 


CARBOHYD 


96 


96 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


536 


536 


N- LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


LAKbUrliL) 


54 0 


54 0 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


VARSPLIC 


568 


704 


VRLEKTDT I RCI KS CRPNDVNCVLDPVHT I SHTVI SLPTFR 


FT 








EFTRPEEI I FLRAITPTYPANQADI I FDITEGNLRESFDI I 


FT 








KR YMDGMT VGWRQVR P I VGPFHAI LKLEMN YVMGGWSHR 


FT 








NIVNVHIFVSEYWF -> RCERLP CNENKECQSLPLR I TYY 


FT 








HLSFPTNIQVPTDI FRMGPSNAVPGDKI LLSII SGNQEGFF 



FT TTKKVNNHSGI WMQRQI TEPRDLLLTI QMQLTRHGTVNTF 

FT I AKLF VFVSAQL (in i so form C) . 

FT /FTId=VSP_007378- 

SQ SEQUENCE 704 AA; 78137 MW; D47D5A30D5E42 932 CRC64; 

Query Match 28.5%; Score 722; DB 1; Length 704; 

Best Local Similarity 34.6%; Pred. No. 2.6e-46; 

Matches 163; Conservative 68; Mismatches 158; Indels 82; Gaps 22; 

Qy 15 LCLPSPGN AQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI P 70 

= 1 Hh = II III I I hlhll = I =hl :| I I 

Db 279 I CQNT PGSFRCRPKLQCMNGF I QD - ALGNCI D I NECLSTNM P C PAGQ I C I NTDGS YTC - Q 336 

Qy 71 RTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMDE-SNQCVDVD 129 

I III ; :||||1 

Db 337 RISP SCGRGYHLNEDGTRCVDVD 359 

Qy 130 ECATDSHQCN PTQ I C I NTEGGYTCS CTDGYW - - LLEGQCLD IDE CRY GYCQQLCAN V 184 

I' I II' I I I I II Nlhlll j j | 

Db 360 ECSSSDQPCGEGHVCINGPGNYRCECKSGYSFD VI SRTCI DINECRRYPGRLCAHKCENT 419 

Qy 185 PGS YS CTCN PGFTLNEDGRS CQDVNECATENP CVQTCVNTYGSF I CRCDPGYELEE - DG V 243 

MM III II h llllhhlll = =11111 llh I I h:| : lh 
Db 420 PGSYYCTCTMGFKLSSDGRSCEDLNECES-SPCSQECANVYGSYQCYCRRGFQLSDIDGI 478 

Qy 244 HCSDMDECSF--SEFLCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCN 300 

I ' I I lh: hll j| | : .1 i' l. | I 

Db 479 SCEDIDECALPTGGHICSFRCINIPGSFQCTCPSTGYRLAPNARNCQDIDECVAETHNCS 538 

Qy 301 LQQTCYNLQGGFKCI DPI RCE E P YLR I SDN RCM- -CPAENPGC-RDQPFTI 348 

1 I : h 1 I I I : I : : I 1 I = I 11=1 = I I || 

Db 53 9 FNETCFNIQGGFRCLS -LECPENYRKSGDTVRLEKTDTI RCI KSCRPND VNCVLDPVHTI 597 

Qy 349 LYRDMDWSGRSV - - PADI FQMQA - TTRYPG - - AY YI FQI KSGNEGREF YMRQTG 398 

■ ■ \ I :| ::| I II I II I II I III 

Db 598 SHTVISLPTFREFTRPEEI IFLRAITPTYPANQADI IFDITEGNLRESFDI IKRYM- -DG 655 

Qy 399 P I S ATLVMTR P I KGPRE I QLDLEM - I TVNTVI NFRGS S VI RLR I Y VSQ YP F 448 

= III II I III : l = = I = hlH I 

Db 656 MTVGWRQVRPI VGPFHAIIjKLEMNYVMGGWSHR - -NI VNVHI FVSEYWF 7 04 

RESULT 12 
FBL2_HUMAN 

ID FBL2_HUMAN STANDARD; PRT; 1184 AA. 

AC P98095; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 15-SEP-2O03 {Rel. 42, Last annotation update) 

DE Fibulin-2 precursor. 

GN FBLN2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N.A. 



RC TISSUE=Fibroblast; 

RX MEDLINE=95104855; PubMed=78 06230; 

RA Zhang R.-Z., Pan T.-C, Zhang Z.-Y., Mattei M.-G., Timpl R. , 

RA Chu M. -L. ; 

RT "Fibulin-2 (FBLN2) : human cDNA sequence, mRNA expression, and mapping 

RT of the gene on human and mouse chromosomes . " ; 

RL Genomics 22:425-430(1994). 

RN [2] 

RP DEVELOPMENTAL STAGE. 

RX PubMed=8737292 ; 

RA Miosge N., Gotz W. , Sasaki T., Chu M.-L. # Timpl R. , Herken R. ; 

RT "The extracellular matrix proteins fibulin-1 and fibulin-2 in the 

RT early human embryo . " ; 

RL Histochem. J. 28 : 109-116 (1996) . 

CC -!- FUNCTION: ITS BINDING TO FIBRONECTIN AND SOME OTHER LIGANDS IS 
CC CALCIUM DEPENDENT. 

CC -!- SUBUNIT: Homotrimer; disul fide- linked . Interacts with LAMA2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Secreted; extracellular matrix. 

CC -!- TISSUE SPECIFICITY: Component of both basement membranes and other 

CC connective tissues. Expressed in heart, placenta and ovary. 

CC -!- DEVELOPMENTAL STAGE: Widely expressed during embryonic 

CC development. Primarily detected within the neurop it helium, spinal 

CC ganglia and peripheral nerves. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 3 anaphylatoxin-like domains. 

CC -!- SIMILARITY: Contains 11 EGF -like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

cc 

DR EMBL; X824 94; CAA57876.1; -. 

DR PIR; A55184; A55184 . 

DR HSSP; P00736; 1APQ. 

DR Genew; HGNC:3601; FBLN2 . 

DR MIM; 135821; -. 

DR GO; GO: 0005578; C: extracellular matrix; TAS. 

DR GO; GO: 0005509; F:calcium ion binding activity; TAS. 

DR GO; GO: 0005207; F; extracellular matrix glycoprotein; TAS. 

DR InterPro; I PRO 00 020; Anaphylatoxin. 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; I PRO 01881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF01821; ANATO; 2. 

DR Pfam; PF00008; EGF; 7. 

DR SMART; SM00104; ANATO ; 3. 

DR SMART; SM00179; EGF_CA; 9. 

DR PROSITE; PS00010; ASX__HYDROXYL ; 5. 

DR PROSITE; PS01177; ANAPHYLATOXIN_l ; 3. 

DR PROSITE; PS01178; ANAPHYLATOXIN_2 ; 3. 

DR PROSITE; PS00022; EGF_1 ; FALSE_NEG. 

DR PROSITE; PS01186; EGF_2; 5. 



DR 


PROSITE; 


PS01187, 


? EGF CA; 


9. 


KW 


Signal; Glycoprotein; Extracellular matrix; Plasma; EGF-like domain; 


KW 


Calcium-binding; 


Repeat . 




FT 


SIGNAL 


1 


27 


POTENTIAL . 


FT 


CHAIN 


28 


1184 


FIBULIN-2. 


FT 


DOMAIN 


28 


444 


N. 


FT 


DOMAIN 


28 


177 


SUBDOMAIN NA {CYS-RICH) . 


FT 


DOMAIN 


178 


444 


SUBDOMAIN NB (CYS-FREE) . 


FT 


DOMAIN 


445 


480 


ANAPHYLATOXIN-LIKE 1. 


FT 


DOMAIN 


488 


519 


ANAPHYLATOXIN-LIKE 2. 


FT 


DOMAIN 


521 


553 


ANAPHYLATOXIN-LIKE 3. 


FT 


DOMAIN 


604 


645 


EGF-LIKE 1, CALCIUM-BINDING. 


FT 


DOMAIN 


679 


718 


EGF-LIKE 2. 


FT 


DOMAIN 


719 


763 


EGF-LIKE 3, CALCIUM-BINDING. 


FT 


DOMAIN 


764 


809 


EGF-LIKE 4, CALCIUM-BINDING. 


FT 


DOMAIN 


810 


857 


EGF-LIKE 5, CALCIUM -BINDING . 


FT 


DOMAIN 


858 


900 


EGF-LIKE 6, CALCIUM -BINDING . 


FT 


DOMAIN 


901 


942 


EGF-LIKE 7, CALCIUM-BINDING . 


FT 


DOMAIN 


943 


981 


EGF-LIKE 8, CALCIUM-BINDING. 


FT 


DOMAIN 


982 


1024 


EGF-LIKE 9, CALCIUM-BINDING. 


FT 


DOMAIN 


1025 


1069 


EGF-LIKE 10, CALCIUM-BINDING. 


FT 


DOMAIN 


1070 


1184 


DOMAIN III. 


FT 


DISULFID 


445 


472 


BY SIMILARITY. 


FT 


DISULFID 


446 


479 


BY SIMILARITY. 


FT 


DISULFID 


459 


480 


BY SIMILARITY. 


FT 


DISULFID 


489 


518 


BY SIMILARITY. 


FT 


DISULFID 


502 


519 


BY SIMILARITY. 


FT 


DISULFID 


521 


545 


BY SIMILARITY. 


FT 


DISULFID 


522 


552 


BY SIMILARITY. 


FT 


DISULFID 


535 


553 


BY SIMILARITY. 


FT 


DISULFID 


608 


620 


BY SIMILARITY. 


FT 


DISULFID 


616 


629 


BY SIMILARITY. 


FT 


DISULFID 


631 


644 


BY SIMILARITY. 


FT 


DISULFID 


683 


693 


BY SIMILARITY. 


FT 


DISULFID 


689 


702 


BY SIMILARITY. 


FT 


DISULFID 


704 


717 


BY SIMILARITY. 


FT 


DISULFID 


723 


736 


BY SIMILARITY. 


FT 


DISULFID 


730 


745 


BY SIMILARITY. 


FT 


DISULFID 


751 


762 


BY SIMILARITY. 


FT 


DISULFID 


768 


781 


BY SIMILARITY. 


FT 


DISULFID 


775 


790 


BY SIMILARITY. 


FT 


DISULFID 


796 


808 


BY SIMILARITY. 


FT 


DISULFID 


814 


827 


BY SIMILARITY. 


FT 


DISULFID 


821 


836 


BY SIMILARITY. 


FT 


DISULFID 


843 


856 


BY SIMILARITY. 


FT 


DISULFID 


862 


875 


BY SIMILARITY. 


FT 


DISULFID 


869 


884 


BY SIMILARITY. 


FT- 


DISULFID 


886 


899 


BY SIMILARITY. 


FT 


DISULFID 


905 


917 


BY SIMILARITY. 


FT 


DISULFID 


913 


926 


BY SIMILARITY. 


FT 


DISULFID 


928 


941 


BY SIMILARITY. 


FT 


DISULFID 


947 


956 


BY SIMILARITY. 


FT 


DISULFID 


952 


965 


BY SIMILARITY. 


FT 


DISULFID 


967 


980 


BY SIMILARITY. 


FT 


DISULFID 


986 


998 


BY SIMILARITY. 


FT 


DISULFID 


994 


1007 


BY SIMILARITY. 


FT 


DISULFID 


1009 


1023 


BY SIMILARITY. 



FT 


DISULFID 


1029 


1042 


BY SIMILARITY. 




FT 


DISULFID 


1036 


1051 


BY SIMILARITY. 




FT 


DISULFID 


1056 


1068 


BY SIMILARITY. 




FT 


CARBOHYD 


180 


18 0 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


507 


507 


N- LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


1035 


1035 


N- LINKED (GLCNAC. . 


.) (POTENTIAL) 


SQ 


SEQUENCE 


1184 


AA; 126543 


MW; CA484 90A55F9EC5D CRC64 ; 


Query Match 




28 .0%; 


Score 709.5; DB 1; 


Length 1184; 



Best Local Similarity 34.3%; 
Matches 146; Conservative 



Pred. No. 3.7e-45; 
61; Mismatches 156; 



Indels 63; Gaps 



12; 



Qy 15 LCLPSPGN AQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI P 70 

if = h h =1 :|| | | |:||:|| :: | || |:| | | | 

Db 780 LCQNTKGSFYCQARQRCMDGF-LQDPEGNCVDINECTSLSEPCRPGFSCINTVGSYTC-- 836 

QY 71 RTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQ-MDESNQCVD VD 129 

MM I I h Mllh 
Db 837 QRNPLI CARGYHASDDGAKCVDVN 860 

Qy 130 ECATDSHQCNPTQICINTEGGYTCSCTDGYW--LLEGQCLDIDECRYG YCQQLCANV 184 

M I I - I hi i I I I I I ; ! M I I 

Db 861 ECETGVHRCGEGQVCHNLPGSYRCDCKAGFQRDAFGRGCIDVNECWASPGRLCQHTCENT 920 

Qy 185 PGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVH 244 

Ml hi II I lh I : I I I I i I I I MM Ml 

Db 921 LGSYRCSCASGFLLAADGKRCEDVNECEAQR-CSQECANIYGSYQCYCRQGYQLAEDGHT 979 

Qy 24 5 CSDMDECS-F S EFLCQHE CVNQPGTYFCS CP - PGY I LLDDNRS CQD I NECEHRNHTCNLQ 302 

h h M h = II hi MM IMI II = = MhhMI I h 

Db 980 CTD I DECAQGAG I LCTFRCLN VPGS YQCACPEQGYTMTANGRS CKD VDE CALGTHNCS EA 1039 

Qy 303 QTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRD QPFTI LYRDMDW 356 

Mhhll hh I h-l : I IN | h :: 

Db 104 0 ETCHNIQGSFRCL-RFECPPNYVQVSKTKC ERTTCHDFLECQNSPARITHYQLNFQ 1094 

Qy 357 SGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPIKGPREI 416 

M III lh: : | | Ml | h = : : | : ||: 

Db 1095 TGLLVPAHI FR I GPAPAFTGDT I ALN 1 1 KGNEEGYFGTRRLNAYTGWYLQRAVLEPRDF 1154 

Qy 417 QLDLEM 422 

Ihll 

Db 1155 ALDVEM 1160 



RESULT 13 
FBL1_HUMAN 

ID FBL1_HUMAN STANDARD; PRT; 703 AA. 

AC P23142; P23143; P23144; P37888; Q8TBH8 ; Q9HBQ5; Q9UGR4 ; Q9UH41; 

DT 01 -NOV- 1991 (Rel. 20, Created) 

DT 15-SEP-2003 (Rel. 42, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-1 precursor. 

GN FBLN1 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



OX NCBIJTaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOPORMS A; B AND C) . 

RX MEDLINE=91100426; PubMed=2269669 ; 

RA Argraves W.S., Tran H., Burgess W.H., Dickerson K. ; 

RT "Fibulin is an extracellular matrix and plasma glycoprotein with 

RT repeated domain structure."; 

RL J, Cell Biol. 111:3155-3164(1990). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM D) , TISSUE SPECIFICITY, AND INTERACTION 

RP WITH FN1 AND FGB. 

RX PubMed=9 106159; 

RA Tran H. , Mattei M- , Godyna S., Argraves W.S.; 

RT "Human fibulin- ID: molecular cloning, expression and similarity with 

RT Sl-5 protein, a new member of the fibulin gene family."; 

RL Matrix Biol. 15:479-493(1997). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM D) . 

RX MEDLINE=99253993; PubMed=10318851 ; 

RA Krichevsky A.M. , Metzer E., Rosen H. ; 

RT "Trans la tional control of specific genes during differentiation of 

RT HL-60 cells."; 

RL J. Biol. Chem. 274:14295-14305(1999). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM C) . 

RA Gu J . R . , Wan D . F . , Zhao X.T. , Zhou X . M . , Jiang H . Q . , Zhang P.P., 

RA Qin W.X., Huang Y., Qiu X.K., Qian L.F., He L.P., Li H.N. , Yu Y. , 

RA Yu J. , Han L.H. ; 

RT "Novel Human cDNA clones with function of inhibiting cancer cell 

RT growth . " ; 

RL Submitted (DEC-1999) to the EMBL/ G enBank / DDB J databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 0057 165; PubMed=l 05912 08 ; 

RA Dunham I., Hunt A.R., Collins J.E., Bruskiewich R. , Beare D.M., 

RA Clamp M. , Smink L. J. , Ainscough R. , Almeida J, P., Babbage A.K., 

RA Bagguley C, Bailey J . , Barlow K.F., Bates K.N., Beasley O.P., 

RA Bird CP., Blakey S.E., Bridgeman A.M., Buck D . , Burgess J., 

RA Burrill W.D., Burton J., Carder C. , Carter N.P., Chen Y. , Clark G. , 

RA Clegg S.M., Cobley V.E,, Cole C.G. , Collier R.E., Connor R . , 

RA Conroy D., Corby N.R., Coville G.J., Cox A.V. , Davis J., Dawson E., 

RA Dhami P.D., Dockree C. , Dodsworth S.J., Durbin R.M., Ellington A.G., 

RA Evans K.L., Fey J.M., Fleming K. , French L. , Garner A. A. , 

RA Gilbert J.G.R., Goward M.E., Grafham D.V., Griffiths M.N.D., Hall C. 

RA Hall R.E., Hall-Tamlyn G . , Heathcott R.W., Ho S., Holmes S., 

RA Hunt S.E., Jones M.C., Kershaw J., Kimberley A.M., King A., 

RA Laird G.K., Langford C.F., Leversha M.A. , Lloyd C, Lloyd D.M., 

RA Martyn I.D., Mashreghi-Mohammadi M. , Matthews L.H., Mccann O.T., 

RA Mcclay J., Mclaren S., McMurray A. A. , Milne S.A., Mortimore B.J., 

RA Odell C.N., Pavitt R. , Pearce A.V. , Pearson D. , Phillimore B.J.C.T., 

RA Phillips S.H., Plumb R.W., Ramsay H. , Ramsey Y. , Rogers L. , Ross M.T 

RA Scott C.E., Sehra H.K., Skuce CD., Smalley S., Smith M.L., 

RA Soderlund C , Spragon L. , Steward C.A., Sulston J.E., Swann R.M. , 

RA Vaudin M. , Wall M. , Wallis J.M., Whiteley M.N., Willey D.L. , 

RA Williams L., Williams S.A., Williamson H. , Wilmer T.E., Wilming L. , 

RA Wright C.L., Hubbard T. , Bentley D.R. , Beck S., Rogers J., Shimizu N 

RA Minoshima S., Kawasaki K. , Sasaki T. , Asakawa S., Kudoh J., 



RA Shintani A., Shibuya K. , Yoshizaki Y. , Aoki N. , Mitsuyama s., 

RA Roe B.A., Chen p., Chu L., Crabtree J., Deschamps S., Do A. , Do T . , 

RA Dorman A., Fang F. , Fu Y. , Hu P., Hua A. , Kenton S., Lai H. , Lao H.I., 

RA Lewis J., Lewis S., Lin S.-P., Loh P., Malaj B . , Nguyen T. , Pan H., 

RA Phan S., Qi S. , Qian Y. , Ray L. , Ren Q. , Shaull S., Sloan D . , Song L. , 

RA Wang Q., Wang Y. / Wang Z., White J., Willingham D. , Wu H., Yao Z., 

RA Zhan M, , Zhang G- , Chissoe S., Murray J., Miller N. , Minx P., 

RA Fulton R. , Johnson D., Bemis G. , Bentley D., Bradshaw H. , Bourne S., 

RA Cordes M., Du Z. , Fulton L. , Goela D. , Graves T., Hawkins J., 

RA Hinds K. , Kemp K, , Latreille p., Layman D . , Ozersky P., Rohlfing T. , 

RA Scheet P., Walker C. , Wamsley A. , Wohldmann P., Pepin K. , Nelson J., 

RA Korf I., Bedell J. A., Hillier L. , Mardis E . , Waterston R. , Wilson R. , 

RA Emanuel B.S., Shaikh T. , Kurahashi H- , Saitta S., Budarf M.L., 

RA McDermid H.E., Johnson A., Wong A.C.C., Morrow B.E., Edelmann L. , 

RA Kim U.J., Shizuya H. , Simon M.I., Dumanski J. P., Peyrard M. , Kedra D. , 

RA Seroussi E. , Fransson I., Tapia I., Bruder C.E., O'Brien K.P., 

RA Wilkinson P., Bodenteich A. , Hartman K. , Hu X., Khan A.S., Lane L., 

RA Tilahun Y., Wright H.; 

RT "The DNA sequence of human chromosome 22."; 

RL Nature 402:489-495 (1999) - 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM C) . 

RC TlSSUE^Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H. , Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T - , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L. , 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E. , 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J. , McKernan K.J. , Malek J.A. , Gunaratne P.H. , 

RA Richards S. , Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A, , Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A., Young A.C, Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C, Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A. , Schein J.E. , Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [7] 

RP SEQUENCE OF 1-26 FROM N.A. 

RX PubMed=1182 9738; 

RA Castoldi M., Chu M.-L.; 

RT "Structural and functional characterization of the human and mouse 

RT fibulin-1 gene promoters: role of Spl and Sp3 . " ; 

RL Biochem. J. 362:41-50(2002). 

RN [8] 

RP SEQUENCE OF 30-44. 

RX MEDLINE=89354537; PubMed=2527614 ; 

RA Argraves W.S., Dickerson K. , Burgess W.H., Ruoslahti E.; 

RT "Fibulin, a novel protein that interacts with the fibronectin 

RT receptor beta subunit cytoplasmic domain."; 



RL Cell 58:623-629(1989). 

RN [91 

RP SELF-ASSOCIATION AND INTERACTION WITH FN1 . 

RX PubMed=14 00330; 

RA Balbona K. , Tran H. , Godyna S,, Ingham K.C., Strickland D.K., 

RA Argraves W . S . ; 

RT "Fibulin binds to itself and to the carboxyl- terminal heparin -binding 

RT region of f ibronectin. " ; 

RL J. Biol. Chem. 267:20120-20125(1992). 

RN [10] 

RP POSSIBLE FUNCTION. 

RX PubMed=7534784; 

RA Roark E.F., Keene D.R., Haudenschild C.C., Godyna S., Little CD., 

RA Argraves W . S . ; 

RT "The association of human fibulin-1 with elastic fibers: an 

RT immunohistological, ultrastructural , and RNA study."; 

RL J. Histochem. Cytochem. 43:401-411(1995). 

RN [11] 

RP INTERACTION WITH FGB. 

RX PubMed=7642629; 

RA Tran H. , Tanaka A., Litvinovich S.V., Medved L.V., Haudenschild C.C., 

RA Argraves W . S . ; 

RT "The interaction of fibulin-1 with fibrinogen. A potential role in 

RT hemostasis and thrombosis."; 

RL J. Biol. Chem. 270:19458-19464(1995). 

RN [12] 

RP DEVELOPMENTAL STAGE. 

RX PubMecU8737292; 

RA Miosge N. , Gotz W. , Sasaki T. , Chu M.-L., Timpl R. , Herken R.; 

RT "The extracellular matrix proteins fibulin-1 and fibulin-2 in the 

RT early human embryo . " ; 

RL Histochem. J. 28:109-116(1996). 

RN [13] 

RP INDUCTION. 

RX MEDLINE=96133928; PubMed=8552629 ; 

RA Clinton G.M., Rougeot C, Derancourt J. , Roger P., Defrenne A., 

RA Godyna S., Argraves W.S., Rochefort H.; 

RT "Estrogens increase the expression of fibulin-l / an extracellular 

RT matrix protein secreted by human ovarian cancer cells."; 

RL Proc. Natl. Acad. Sci. U.S.A. 93:316-320(1996). 

RN [14] 

RP CALCIUM, SELF -ASSOCIATION, AND FN1 -BINDING SITES. 

RX PubMed=92784l5; 

RA Tran H. , VanDusen W.J., Argraves W.S.; 

RT "The self -association and f ibronectin-binding sites of fibulin-1 map 

RT to calcium-binding epidermal growth factor-like domains."; 

RL J. Biol. Chem. 272:22600-22606(1997). 

RN [15] 

RP ROLE IN TUMOR FORMATION AND INVASION. 

RX PubMed=9393974; 

RA Qing J., Maher V.M. , Tran H. , Argraves W.S., Dunstan R.W., 

RA McCormick J. J. ; 

RT "Suppression of anchorage- independent growth and matrigel invasion and 

RT delayed tumor formation by elevated expression of fibulin- ID in human 

RT f ibrosarcoma -derived cell lines."; 

RL Oncogene 15:2159-2168(1997). 

RN [16] 



RP INDUCTION. 

RX PubMed=98 11350; 

RA Roger P., Pujol P., Lucas A., Baldet P., Rochefort H. ; 

RT "Increased immunostaining of fibulin-l, an estrogen-regulated protein 

RT in the stroma of human ovarian epithelial tumors."; 

RL Am. J . Pathol. 153:1579-1588(1998). 

RN [17] 

RP ROLE IN TUMOR FORMATION AND INVASION. 

RX PubMed=94 66671; 

RA Hayashido Y. , Lucas A. , Rougeot C, Godyna S., Argraves W.S., 

RA Rochefort H. ; 

RT "Estradiol and fibulin-l inhibit motility of human ovarian- and 

RT breast-cancer cells induced by f ibronectin. " ; 

RL Int. J. Cancer 75:654-658(1998) . 

RN [18] 

RP INTERACTION WITH NOV. 

RX PubMed=9927660; 

RA Perbal B . , Martinerie C, Sainson R. , Werner M. , He B., Roizman B. ; 

RT "The C- terminal domain of the regulatory protein NOVH is sufficient to 

Query Match 27.7%; Score 701.5; DB 1; Length 703; 

Best Local Similarity 34.4%; Pred. No. 8.6e-45; 

Matches 162; Conservative 66; Mismatches 150; Indels 93; Gaps 22; 

QY 16 CLPS PGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPV 7 5 

II = II =11 I | |:||:|[ :| | |:| | | | 
Db 288 CRP KLQCKSGF I QD - ALGNCI DI NECLS I SAPCPI GHTCI NTEGS YTC 334 

Qy 76 YRGPYSNPYSTP YSGPYPAAAPPLSAPNYPTI SRPLI CRPGYQM-DESNQCVDVDECATD 134 

Db : " I I ! : : I : I I I I I I I I 

D£ > 335 QKNVPN CGRGYHLNEEGTRCVDVDECAPP 363 

Qy 135 SHQCNPTQICINTEGGYTCSCTDGYWL- - LEGQCLDIDEC-RY- - GYCQQLCANVPGSYS 189 

= I 1 = 1= I : I I ||: : |:|::|| || | | | j|| 

Db 364 AEPCGKGHRCVNSPGSFRCECKTGYYFDGISRMCVDVNECQRYPGRLCGHKCENTLGSYL 423 

Qy 19 0 CTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDM 248 

1=1= II h llllhhlll:: :|| I I I |||: | | ||:| : ||| | |: 
Db 424 CSCSVGFRLSVDGRSCEDINECSS-SPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDI 482 

Qy 24 9 DE CS F - - S EFLCQHECVNQPGTYF CS CP - PG Y I LLDDNRS CQD iNECEHRNHTCNLQQTC 3 05 

III s =1 = hi M- I I I I II I : hlllhll I h= =11 

Db 483 DECAL PTGGH I CS YRC INI PGS FQCS CPS SG YRLAPNGRNCQD I DECVTG I HNCS I NETC 542 

Qy 3 06 YNLQGGFKCIDPIRCEEPYLRISDN RCMCPAENPGCRDQPFTILYRDMDW 356 

= hlll|:|: ||||: [|: || | :: : : 

Db 543 FN I QGGFRCL - AFECPEN YRRSAATLQQEKTDT VRC I KSCRPNDVTCVFDPVHTI 596 

Qy 357 SGRSV PAD I FQMOA-TTRYPG- -AYYI FQI KSGNEGREF YMRQTG 398 

I : 1=1 : : I I -I I IN || | HI 

Db 597 SHTVISLPTFREFTRPEEIIFLRAITPPHPASQANIIFDITEGNLRDSFDIIKRYM--DG 654 

Qy 399 PISATLVMTRPI KGPREIQLDLEM- ITVNTVINFRGSSVIRLRI YVSQYPF 448 

: HI M I Ml I h= I =h :|hlh| | 

Db 655 MTVGVVRQVRPI VGPFHAVLKLEMNYWGGWSHR - -NWNVRI FVSEYWF 703 



RESULT 14 
FBL1__M0USE 

ID FBL1_M0USE STANDARD; PRT; 705 AA. 

AC Q08879; Q08878; Q8C3B1; Q91ZC9; Q922K8; 

DT 01-OCT-1994 (Rel. 30, Created) 

DT 15-SEP-2003 (Rel. 42, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-1 precursor (Basement -membrane protein 90) (BM-90) . 

GN FBLN1 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus 

OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS C AND D) , AND LIGANDS INTERACTION 

RX MEDLINE=93358897; PubMed=8354280; 

RA Pan T.-C, Kluge M. , Zhang R.Z., Mayer U. , Timpl R . , Chu M.-L.; 

RT "Sequence of extracellular mouse protein BM-90/f ibulin and its 

RT calcium-dependent binding to other basement -membrane ligands." ; 

RL Eur. J. Biochem. 215:733-740(1993). 
RN 12] 

RP SEQUENCE FROM N.A. (ISOFORM C) . 

RC STRAIN=C57BL/6J; TISSUE=Head, and Urinary bladder ; 

RX MEDLINE=22354683; PubMed=12466851 ; 

RA Okazaki Y. , Furuno M., Kasukawa T. , Adachi J., Bono H. , Kondo S., 

RA Nikaido I., Osato N . , Saito R. , Suzuki H. , Yamanaka I., Kiyosawa H. , 

RA Yagi K. , Tomaru Y . , Hasegawa Y., Nogami A., Schonbach c. , Gojobori T., 

RA Baldarelli R . , Hill D.P.,.Bult C. , Hume D.A. , Quackenbush J., 

RA Schriml L.M. , Kanapin A. , Matsuda H., Batalov S., Beisel K.W. , 

RA Blake J. A. , Bradt D., Brusic V., Chothia C, Corbani L.E., Cousins S., 

RA Dalla E. , Dragani T.A., Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T., Gariboldi M . , Gissi C. , Godzik A., Gough J., 

RA Grimmond S., Gustincich S., Hirokawa N., Jackson I. J., Jarvis E.D., 

RA Kanai A., Kawaj i H. , Kawasawa Y. , Kedzierski R.M., King B.L. , 

RA Konagaya A., Kurochkin I. v., Lee Y. , Lenhard B., Lyons P. A., 

RA Maglott D.R., Maltais L., Marchionni L. , McKenzie L. , Miki H., 

RA Nagashima T. , Numata K. , Okido T. , Pavan W.J., Pertea G., Pesole G., 

RA Petrovsky N. , Pillai R. , Pontius J.U. , Qi D. , Ramachandran S., 

RA Ravasi T. , Reed J.C., Reed D. J. , Reid J., Ring B.Z., Ringwald M. , 

RA Sandelin A., Schneider C. , Semple C.A., Setou M. , Shimada K. , 

RA Sultana R., Takenaka Y. , Taylor M.S., Teasdale R.D., Tomita M. , 

RA Verardo R. , Wagner L. , Wahlestedt C, Wang Y. , Watanabe Y. , Wells C. , 

RA Wilming L.G., Wynshaw-Boris A., Yanagisawa M. , Yang I., Yang L. , 

RA Yuan Z., Zavolan M. , Zhu Y. , Zimmer A., Carninci P., Hayatsu N . , 

RA Hirozane-Kishikawa T., Konno H. , Nakamura M. , Sakazume N., Sato K. , 

RA Shiraki T. , Waki K. , Kawai J., Aizawa K. , Arakawa T. , Fukuda S., 

RA Hara A., Hashizume W. , Imotani K. , Ishii Y., Itoh M . , Kagawa I., 

RA Miyazaki A., Sakai K. , Sasaki D., Shibata K. , Shinagawa A., 

RA Yasunishi A., Yoshino M. , Waterston R. , Lander E.S., Rogers J., 

RA Birney E. , Hayashizaki Y.; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573 (2002) . 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM C) . 

RC TISSUE=Breast tumor; 

RX MEDLINE=22388257; PubMed=12477932 ; 



RA Strausberg R.L. , Feingold E.A., Grouse L.H., Derge J.G. , 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H. , Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T. , Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B. , Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G. J. , Abramson R.D. , Mullahy s!j., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H. 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk s!w., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X. , Gibbs R.A., 

RA Fahey J., Helton E . , Ketteman M., Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W. , Touchman J.w. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butt erf ield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A. , Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002) 

RN [4] 

RP SEQUENCE OF 1-26 FROM N.A. 

RX PubMed= 11829738; 

RA Castoldi M., Chu M.-L.; 

RT "Structural and functional characterization of the human and mouse 

RT fibulin-1 gene promoters: role of Spl and Sp3 . " ; 

RL Biochem. J. 362:41-50(2002)., 

RN [5] 

RP CHARACTERIZATION OF NID AFFINITY. 

RX PubMed= 7844816; 

RA Sasaki T. , Kostka G., Goehring W., Wiedemann H. , Mann K. , Chu M -L 

RA Timpl R. ; 

RT "Structural characterization of two variants of fibulin-1 that differ 

RT in nidogen affinity."; 

RL J. Mol. Biol. 245:241-250(1995). 

RN [6] 

RP DEVELOPMENTAL STAGE. 

RX PubMed= 8850569; 

RA Zhang H.-Y., Timpl R. , Sasaki T. , Chu M.-L., Ekblom P. ; 

RT "Fibulin-1 and fibulin-2 expression during organogenesis in the 

RT developing mouse embryo."; 

RL Dev. Dyn. 205:348-364(1996). 

RN [7] 

RP NID-BINDING SITE. 

RC STRAIN=129/Sv; 

RX PubMed=92 99350; 

RA Adam S., Goehring W. , Wiedemann H., Chu M.-L., Timpl R. , Kostka G.; 

RT "Binding of fibulin-1 to nidogen depends on its C- terminal globular 

RT domain and a specific array of calcium-binding epidermal growth 

RT factor-like (EG) modules."; 

RL J. Mol. Biol. 272:226-236(1997). 

RN [8] 

RP BINDING TO LAMA2 . 

RX PubMed=10022829; 

RA Talts J.F., Andac Z., Goehring W., Brancaccio A. , Timpl R. ; 

RT "Binding of the G domains of laminin alphal and alpha2 chains and 

RT perlecan to heparin, sul fat ides, alpha -dystroglycan and several 



RT extracellular matrix proteins."; 

RL EMBO J. 18:863-870(1999}. 

RN [9] 

RP INTERACTION WITH AGC1 AND CSPG2 . 

RX PubMed=10400671; 

RA Aspberg A., Adam S., Kostka G. , Timpl R., Heinegaard D. ; 

RT "Fibulin-1 is a ligand for the C-type lectin domains of aggrecan and 

RT vers i can. "; 

RL J. Biol. Chem. 274:20444-20449(1999). 

RN [10] 

RP INTERACTION WITH NID. 

RX PubMed= 11589703; 

RA Ries A., Goehring W., Fox J.W., Timpl R. , Sasaki T. ; 

RT "Recombinant domains of mouse nidogen-1 and their binding to basement 

RT membrane proteins and monoclonal antibodies . " ; 

RL Eur. J. Biochem. 268:5119-5128(2001). 

RN [11] 

RP DOWN-REGULATION BY GLUCOCORTICOIDS. 

RX PubMed= 11737251; 

RA Gu Y.-C., Talts J.F., Gullberg D. , Timpl R., Ekblom M. ; 

RT "Glucocorticoids down-regulate the extracellular matrix proteins 

RT fibronectin, fibulin-1 and fibulin-2 in bone marrow stroma."; 

RL Eur. J. Haematol. 67:176-184(2001). 

RN . [12] 

RP TISSUE SPECIFICITY, AND DEVELOPMENTAL STAGE . 

RX PubMed=11238726; 

RA Ohsawa I . , Takamura C . , Kohsaka S . ; 

RT "Fibulin-1 binds the amino- terminal head of beta -amyloid precursor 

RT protein and modulates its physiological function."; 

RL J. Neurochem. 76:1411-1420(2001). 

RN [13] 

RP INTERACTION WITH E6, AND INHIBITION OF E6 -MEDIATED TRANSFORMATION 

RX PubMed=12200142; 

RA Du M. , Fan X. , Hong E. , Chen J.J.; 

RT "Interaction of oncogenic papillomavirus E6 proteins with fibulin-1."; 

RL Biochem. Biophys . Res. Commun. 296:962-969(2002). 

RN [14] 

RP DEVELOPMENTAL STAGE. 

RX PubMed=11836357; 

RA Debeer p., Schoenmakers E.F.P.M., Twal W.O., Argraves W.S., 

RA De Smet L. , Fryns J. -p., Van De Ven W.J.M.; 

RT "The fibulin-1 gene (FBLN1) is disrupted in a t(12;22) associated with 

RT a complex type of synpolydactyly. " ; 

RL J. Med. Genet. 39:98-104(2002). 

CC -i- FUNCTION: Incorporated into fibronect in-containing matrix fibers. 

CC May play a role in cell adhesion and migration along protein 

CC fibers within the extracellular matrix (ECM) . Could be important 

CC for certain developmental processes and contribute to the 

CC supramolecular organization of ECM architecture, in particular to 

CC those of basement membranes . 

CC -!- SUBUNIT: Interacts with iself and with various extracellular 
CC matrix components such as FN1, LAMA1 , LMA2, NID, AGC1, CSPG2 and 

CC type IV collagen. Interacts also with papillomavirus E6 proteins. 

CC Binding analysis demonstrated for isoform C a 100- fold stronger 

CC binding to the basement membrane protein NID than for isoform D. 

CC -!- SUBCELLULAR LOCATION: Secreted; extracellular matrix. 

CC -!- ALTERNATIVE PRODUCTS: 



CC Event =Altemative splicing; Named isoforms=4; 

CC Name=D; 

CC IsoId=Q08879-l ; Sequence=Di splayed; 

CC Name=A; 

CC IsoId=Q08879-3; Sequence=Not described; 

CC Name=B; 

CC IsoId=Q08879-4; Sequence=Not described; 

CC Name=C; 

CC IsoId=Q08879-2; Sequence=VSP_0 01386; 

CC Note=Conflict E -> A at position 571 of isoform C (Ref.l); 

CC -l- TISSUE SPECIFICITY: Detected in most organs (brain, heart, lung, 

CC spleen, liver and kidney) . Neurons are the predominant source of 

CC production in the brain. Not expressed significantly by astrocytes 

CC or microglia. 

CC -!- DEVELOPMENTAL STAGE: The differential expression of the fibulin 

CC family contributes to the formation of molecularly distinct 

CC extracellular matrices already during early developmental stages 

CC of a large number of tissues. Increase expression at neonate stage 

CC in the brain. Expressed in interdigital regions of the handplate 

CC of a 12 dpc embryo and in the lateral perichondrial region. 

CC Similar expression persists in the 13 dpc handplate particularly 

CC in the perichondrial regions and apical aspects of the develoioinQ 

CC digits. 

CC -!- INDUCTION: Glucocorticoids suppressed mRNA expression and protein 

CC synthesis. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 3 anaphyl a toxin -like domains. 

CC -!- SIMILARITY: Contains 9 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

Query Match 27.7%; Score 700.5; DB 1; Length 705; 
Best Local Similarity 34.7%; Pred. No. le-44; 

Matches 161; Conservative 68; Mismatches 156; Indels 79; Gaps 22; 

16 CLPS PGNAOAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPV 7 5 
II HI =11 | | |:||:|| :j | I 
Db 290 CRP KLQCKSGF I QD - ALGNC I D I NE CLS I SAP CPVGQTC I NTEGS YTC 336 

Qy 7 6 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQM-DESNQCVD VDECATD 134 

Dh «, 1 11 I II = =1 =1111111= 

D£ > 337 QKNVPN CGRGYHLNEEGTRCVDVDECSPP 365 

Qy 135 SHQCNPTQ I C I NTEGGYTCS CTDGYWL - - LEGQCLD I DEC - RY - - GYCQQLCAN VPGS YS 189 

: I I : I = I : I I i : : = Nihil II I I I ||h 

Db 3 66 AEPCGKGHHCLNSPGSFRCECKAGFYFDGISRTCVDINECQRYPGRLCGHKCENTPGSFH 425 

Qy 190 CTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDM 248 

hh M h II II h llll I =11 I I I llh I I || = | = III | | x 

Db 426 CSCSAGFRLSVDGRSCEDVNEC-LNSPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDI 484 

Qy 249 DECSF- - SEFLCQHECVNQPGTYFCSCP- PGYILLDDNRSCQDINECEHRNHTCNLQQTC 305 

Db 485 DE CALPTGGHI CS YRC I N I PGS FQ CS CPS SG YRLAPNGRNCQD I DECVTG I HNCS I NETC 544 

Qy 306 YNLQGGFKCIDPI RCEEPYLRI SDN RCM- -CPAENPGC-RDQPFTILYRDM 353 



: I : I I I : I • I I I I : ! II- I ■ III I • • : 

Db 545 FNI QGSPRCLS -FECPENYRRSADTFRQEKTDTVRCI KSCRPNDEACVRDPVHTVSHTVI 603 

Qy 354 D WSGRS V - - PAD I FQMQATT - R Y PG - -AYYIFQIKSGNEGREFYM- - -RQTGPISATLV 405 

= : I I H = = l I II I II I II I = =1 

Db 604 SLPTFREFTRPEEI I FLRAVTPLYPANQADI I FDI TEGNLRDSFDI I KRYEDGMTVGVVR 663 

Qy 4 06 MTRPIKGPREIQLDLEM-ITVNTVINFRGSSVIRLRIYVSQYPF 448 

III [I I III = I- I :h : 1 = 11 = 1 I 

Db 664 QVRPIVGPFYAVLKLEMNYVLGGWSHR--NWNVHIFVSEYWF 705 

RESULT 15 
FBL1_BRARE 

ID FBL1_BRARE STANDARD; PRT; 681 AA. 

AC 042182; 042183; 

DT 15-SEP-2003 (Rel . 42, Created) 

DT 15-SEP-2003 (Rel. 42, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-1 precursor. 

GN FBLN1 . 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cyprini formes; 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7955; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOF0RMS C AND D) . 

RA Zhang H.-Y., Lardelli M. , Ekblom P.; 

RT "Sequence of zebrafish fibulin-1 and its expression in developing 

RT heart and other embryonic organs." ; 

RL Dev. Genes Evol . 207:340-351(1997). 

CC -!- FUNCTION: Incorporated into f ibronectin-containing matrix fibers. 

CC May play a role in cell adhesion and migration along protein 

CC fibers within the extracellular matrix (ECM) . Could be important 

CC for certain developmental processes and contribute to the 

CC supramolecular organazation of ECM architecture, in particular to 

CC those of basement membranes. 

CC -!- SUBUNIT: Interacts with iself and with various extracellular 
CC matrix components such as FNI , LAMM, NID, AGC1 and CSPG2 . 

CC -!- SUBCELLULAR LOCATION: Secreted; extracellular matrix. 
CC ALTERNATIVE PRODUCTS: 

CC Event =Alternative splicing; Named isoforms=2; 

CC Name=D; 

CC Isold=042182-1 ; Sequence=Displayed; 

CC Name=C ; 

CC IsoId=042182-2; Sequence=VSP_007379 ; 

CC DEVELOPMENTAL STAGE : Isoform C is detected in the later blastula 

CC period, 4 h after fertilization. Isoform D is not detected at this 

CC stage, it first appears during the gastrula period in 8-h-old 

CC embryos. Expression of both isoforms is then maintained throughout 

CC development. During later developmental stages, prominent 

CC expression is seen in regions where tissue compartments are 

CC continuously moving in relation to each other. 

CC -l- SIMILARITY: Belongs to the fibulin family. 

CC SIMILARITY: Contains 3 anaphylatoxin-like domains. 

CC -■- SIMILARITY: Contains 9 EGF-like domains. 
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1; 
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EMBL; AF013752; AAB80945 


l; 






DR 


HSSP; P35555; 1EMN. 








DR 


ZFIN; ZDE 


-GENE-990415-73, 


fblnl. 




DR 


InterPro; 


I PRO 00 020; Anaphylatoxin . 




DR 


InterPro; 


IPR000152; Asx hydroxyl . 




DR 


InterPro ; r 


I PRO 01881; EGF_ 


Ca. 






DR 


InterPro ; 


IPR0062 09; EGF~~ 


like. 




DR 


InterPro ; 


IPR001673; S_mold_ 


repeat . 




DR 


Pfam; PF01821; ANATO; 1. 








DR 


Pfam; PF00008; EGF; 4. 








DR 


SMART; SM00104; ANATO; 2 








DR 


SMART ; SM00179; EGF CA; 5. 






DR 


PROSITE; 


PS01177; 


ANAPHYLATOXIN_l; 1. 




DR 


PROSITE; 


PS01178; 


ANAPHYLATOXIN 2; 1. 




DR 


PROSITE; 


PS00010; 


ASX HYDROXYL; 3. 




DR 


PROSITE; 


PS01186; 


EGF_2 ; 


3. 






DR 


PROSITE; 


PS01187; 


EGF CA, 


6. 






KW 


Signal; Alternative splicing 


; Glycoprotein; Extracellular matrix; 


KW 


Repeat; EGF- like 


domain; 


Calcium-binding. 




FT 


SIGNAL 


1 


17 




POTENTIAL . 




FT 


CHAIN 


18 


681 




FIBULIN-1. 




FT 


DOMAIN 


29 


63 




ANAPHYLATOX I N - LI KE 1. 




FT 


DOMAIN 


68 


107 




ANAPHYLATOXIN-LIKE 2. 




FT 


DOMAIN 


108 


139 




ANAPHYLATOXIN -LIKE 3. 




FT 


DOMAIN 


158 


192 




EGF-LIKE 1. 




FT 


DOMAIN 


193 


238 




EGF -LIKE 2, CALCIUM-BINDING 


(POTENTIAL) . 


FT 


DOMAIN 


239 


284 




EGF-LIKE 3, CALCIUM -BINDING 


(POTENTIAL) . 


FT 


DOMAIN 


285 


331 




EGF-LIKE 4, CALCIUM-BINDING 


(POTENTIAL) . 


FT 


DOMAIN 


332 


373 




EGF-LIKE 5, CALCIUM-BINDING 


(POTENTIAL) . 


FT 


DOMAIN 


374 


415 




EGF-LIKE 6, CALCIUM-BINDING 


(POTENTIAL) . 


FT 


DOMAIN 


416 


455 




EGF-LIKE 7, CALCIUM -BINDING 


(POTENTIAL) . 


FT 


DOMAIN 


456 


499 




EGF-LIKE 8, CALCIUM -BINDING 


(POTENTIAL) . 


FT 


DOMAIN 


500 


554 




EGF-LIKE 9, CALCIUM -BINDING 


( POTENTIAL) . 


FT 


DISULFID 


29 


55 




BY SIMILARITY. 




FT 


DISULFID 


30 


62 




BY SIMILARITY. 




FT 


DISULFID 


43 


63 




BY SIMILARITY. 






DISULFID 


72 


103 




BY SIMILARITY. 




FT 


DISULFID 


85 


104 




BY SIMILARITY. 




FT 


DISULFID 


106 


125 




BY SIMILARITY. 




FT 


DISULFID 


107 


138 




BY SIMILARITY. 




FT 


DISULFID 


114 


139 




BY SIMILARITY. 




FT 


DISULFID 


162 


171 




BY SIMILARITY. 




FT 


DISULFID 


167 


176 




BY SIMILARITY. 




FT 


DISULFID 


178 


191 




BY SIMILARITY. 




FT 


DISULFID 


197 


210 




BY SIMILARITY. 




FT 


DISULFID 


204 


219 




BY SIMILARITY. 




FT 


DISULFID 


225 


237 




BY SIMILARITY. 





FT 


DISULFID 


243 


256 


BY 


SIMILARITY . 


FT 


DISULFID 


250 


265 


BY 


SIMILARITY. 


FT 


DISULFID 


271 


283 


BY 


SIMILARITY. 


FT 




289 


301 


BY 


SIMILARITY. 


FT 


DISULFID 


317 


330 


BY 


SIMILARITY. 


FT 


DISULFID 


336 


348 


BY 


SIMILARITY. 


FT 


DISULFID 


343 


357 


BY 


SIMILARITY. 


FT 


DISULFID 


359 


372 


BY 


SIMILARITY. 


FT 


DISULFID 


378 


390 


BY 


SIMILARITY. 


FT 


DISULFID 


386 


399 


BY 


SIMILARITY. 


FT 


DISULFID 


401 


414 


BY 


SIMILARITY. 


FT 


DISULFID 


420 


429 


BY 


SIMILARITY. 


FT 


DISULFID 


440 


454 


BY 


SIMILARITY. 


FT 


DISULFID 


460 


473 


BY 


SIMILARITY. 


FT 


DISULFID 


469 


482 


BY 


SIMILARITY. 


FT 


DISULFID 


484 


498 


BY 


SIMILARITY. 


FT 


DISULFID 


504 


517 


BY 


SIMILARITY. 


FT 


DISULFID 


511 


526 


BY 


SIMILARITY. 


FT 


DISULFID 


531 


553 


BY 


SIMILARITY. 


FT 


CARBOHYD 


173 


173 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


VARSPLIC 


542 


681 


RPRVDRAD 1 1 RCVKSCQHND I S CVLNP I LSHSHTA I SLPTF 



FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



SEQUENCE 



681 AA; 



REFNKP EE I VFLRS PTPTHLPHMDS PE I VYD I LEGN I QNSF 
DI I KRLDHGM I VG WKQVRPLVG P VRTVLKLAMNYVTNGW 
SHRNI INVRIYVSEFWF -> RCERLSCNESNECMAFTRRI 
TYYQLTFPAKI PVPTDLFRMGPSNTALGDDI EVAI VDGNRD 
G FFAAKRLDHGGVLVLQKP I AW PQDFQ I ALEMKLKRFGHLS 
I YLFKI RP VRHARRHQQRY (in isoform C) . 
/FTId=VSP_007379. 
74459 MW; 175C9663 05A46699 CRC64 ; 



Query Match 24.9%; Score 631; DB 1; Length 681; 

Best Local Similarity 32.6%; Pred. No. 1.4e-39; 

Matches 153; Conservative 65; Mismatches 162; Inctels 90; Gaps 



22; 



Qy 

Db 

Qy 
Db 



1 6 CLPS PGNAQAQCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPV 7 5 

I I : II II f I 1=11=11 - = II II I I := I 
265 CRP RMQ CAAGF I QD - ALGS C I DI NECVS VTALSRG - QMCFNTVGS F I CQRHS 314 

76 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD-ESNQCVDVDECATD 134 

= 1 II - 1 :]li:|lll 
315 VTCGRGYHLNAEGTRCVDI DECAGP 339 



Qy 135 SHQCN PTQ I C I NTEGGYTCSCTDG YWL - - LEGQCLD I DECRYGY CQQLCANVPGSY 188 

1 III t I II h =1 Mllll I I I h Ill 

Db 340 DNS CD - GHGC I NLVGS YRCECRTGF I FNS I SRSCED I DECR - N YPGRLCAHKCEN I LGS Y 3 97 

Qy 189 S CTCNPGFTLNEDGRS CQDVNE CATENP CVQTCVNTYGSF I CRCDPGYELEE - DGVHCSD 247 

[ II I :|||:| Mill : =11 I I I llh I Ihl = Ih II 

Db 398 KCSCTAGFKLADDGRNCDDVNECES - SPCSQGCANVYGSYQSYCRRGYQLSDADGI TCED 456 

Qy 248 MDECSF- -SEFLCQHECVNQPGTYFCSCP- PGYILLDDNRSCQDINECEHRNHTCNLQQT 3 04 

: I = I I I I : : I - I I Ml = II Nihil hh 
Db 457 IDECALPTGGHICSYRCHNTPGSFHCTCPASGYTLAANGRSCQDIDECLTGTHSCSESES 516 



Qy 305 CYNLQGGF KC I D PI RCEEP YLR I SDN RCM- - CPAENPGCRDQPFTILYRD 352 

hhlllM: I I I I Ihl =1 III 

Db 517 CFNIQGGFRCLS-FDCPANYRRSGDTRPRVDRADI IRCVKSCQHNDISCVLNP- - ILSHS 573 



Qy 353 MDWS GRSVPAD I FQMQATT RYPGAYYI PQI KSGNEGREFYMRQT GP 399 

:| : | :| ::: | :: | || | : : | 

Db 574 HTAISLPTFREFNKPEEIVFLRSPTPTHLPHMDSPEIVYDILEGNIQNSFDIIKRLDHGM 633 

Qy 400 I SATLVMTR P I KGPRE I QLDLEM - 1 TVNTVT NFRGS S VI RLR I YVSQYPF 448 

I : 11= II I I I - I l== I :=| =llllh: I 

Db 634 IVGWKQVRPLVGPWTVLKI^WYVTNGWSHR--NIINVRIYVSEFWF 681 



Search completed: January 9, 2004, 12:34:05 
Job time : 11.287 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



January 9, 2 004, 12:32:07 ; Search time 34.4615 Seconds 
(without alignments) 
3354.684 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-674-379A-13 
2533 

1 MPGIKRILTVTILALCLPSP INFRGSSVI RLRI YVSQYPF 448 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



830525 seqs, 258052604 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



SPTREMBL_23 : 



10 
11 
12 
13 
14 
15 
16 
17 



sp_archea : * 

sp_bacteria : * 

sp_fungi : * 

sp_human : * 

sp_invertebrate : * 

sp_mammal : * 

sp_mhc : * 

sp_organelle : * 

sp_phage : * 
sp_jplant : * 
sp_rodent : * 
sp_virus * 
sp_vertebrate : * 
sp_unclassif ied : 
sp_rvirus : * 
sp_bacteriap:* 
sp_archeap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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2 
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38 
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19 
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6 
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39 
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19 


5 


746 


4 


Q96HB9 


Q96hb9 homo sapien 


40 
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19 


5 
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6 


Q8MJK0 


Q8mjk0 cercopithec 


41 
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19 


5 


1256 


4 


Q9MS15 


Q9nsl5 homo sapien 


42 


495 


19 


5 


1382 


4 


Q9H7K2 


Q9h7k2 homo sapien 


43 


494.5 


19 


5 


1095 


11 


Q60784 


Q60784 mus musculu 


44 


492 .5 


19 


4 


1764 


11 


035806 


0358 06 rattus norv 


45 
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19 


4 


517 


4 


Q9NP01 


Q9np01 homo sapien 



ALIGNMENTS 



RESULT 1 
Q96TF5 

ID Q96TF5 PRELIMINARY 
AC Q96TF5; 

DT 01-DEC-2001 {TrEMBLrel. 
DT 01-DEC-2001 (TrEMBLrel. 
DT 01-MAR-2003 (TrEMBLrel. 



PRT; 443 AA. 

19, Created) 

19, Last sequence update) 
23, Last annotation update) 



DE Mutant p53 binding protein 1 (MBP1) . 

GN MBP1 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N. A. 

RA Tanka S . ; 

RL Submitted (JUL-1999) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Tanaka S. t Sugimachi K. , Sugimachi K. ; 



RT "Human mutant p53 binding protein (MBPl)."; 

RL Submitted (MAR-2000) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AB030655; BAA9288 0.1; 

DR Inter Pro; I PRO 00152 ; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca. 

DR InterPro ; IPR006209; EGFjLike. 

DR InterPro; IPR0014 91; Thrmbomoduln . 

DR Pfam; PF00008; EGF; 4. 

DR PRINTS; PR00907; THRMBOMODULN. 



DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE; PS00010; ASX__HYDROXYL ; 4. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187 ; EGF_CA; 6. 

KW EGF-like domain. 

SQ SEQUENCE 443 AA; 49421 MW; 9CE175F4F388A56D CRC64; 

Query Match 50.9%; Score 12 89; DB 4; Length 443; 

Best Local Similarity 4 9.9%; Pred. No. 7.4e-118; 

Matches 226; Conservative 74; Mismatches 131; Indels 22;' Gaps 5; 
Qy 1 MPGIKRILTVTILALCLPSPGNAQ AQCTNGFDLDRQSGQCLDIDECRTI PEACRG 55 

••II • ••! I II --IM- I I I l-ll lllllhl 

Db 8 LPGSLLLWALLLLLLGSASPQDSEEPDSYTECTDGYEWDPDSQHCRDVNECLTIPEACKG 67 

Qy 56 DMMCVNQNGGYLCI PRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRF 115 

, ' llllhlh [ II I II II I 
Db 68 EMKCINHYGGYLCLPRSAAVINDLHG EGP-PPPVPPAQHPN PCPP 111 

Qy 116 GYQMDESNQCVDVDECATDSHQCNPTQI C I NTEGGYTCS CTDGYWLLEGQCLDI DECRYG 175 

II h : llllllli | 1 hi [ | | | |:| III : : I 'I 

Db 112 GYEPDDQDSCVDVDECAQALHDCRPSQDCHNLPGSYQCTCPDGYRKIGPECVDIDECRYR 171 

Qy 176 YCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPG 235 

III I V | | III | : Ml IIMI II I I hlhhill I 

Db 172 YCQHRCVNLPGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCHQG 231 

Qy 236 YELEEDGVHCSDMDECS FS EFLCQHE CVNQ PGTYF CSCP PG Y I LLDDNRS CQDI NECEHR 295 

III II llhlllhl :|||: llhll -.1111111 I llllhll 
Db 232 YELHRDGFS CSD I DECS YSS YLCQYRCVNE PGRFS CHC PQG YQLL - ATRLCQDI DECESG 290 

Qy 296 NHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDV 355 

I h III I lh = l = l II Hh-hllhllL II Ihll <|::| I : 
Db 291 AHQCSEAQTCVNPHGGYRCVDTNRCVEPYIQVSENRCLCPASNPLCREQPSSIVHRYMTI 350 



Qy 356 VSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPRE 415 

I I'l'l :|' M lllll |||:.|| ,1 '| :|| ||: ||: |||| 

Db 351 TS ERS VPADVFQ I QATS VY PGAYNAFQ I RAGNSQGDF Y I RQ INNVSAMLVLARP VTGPRE 410 

Qy 416 I QLDLEM I TVNTVI NFRGS S VI RLR I YVSQYPF 448 

Mll|:|:|:: = -I llhll -III 
Db 411 YVLDLEMVTMNSLMSYRASSVLRLTVFVGAYTF 443 

RESULT 2 
Q9 JMO 6 

ID Q9JM06 PRELIMINARY; PRT; 443 AA. 

AC Q9JM06; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2000 {TrEMBLrel. 15, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE EGF -containing fibulin-like extracellular matrix protein 2. 

GN EFEMP2 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebra ta; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID= 10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20435063; PubMed=10982184 ; 

RA Katsanis N., Venable S., Smith J.R., Lupski J.R.; 

RT "Isolation of a paralog of the Doyne honeycomb retinal dystrophy gene 

RT from the multiple retinopathy critical region on llql3." ; 

RL Hum. Genet. 106:66-72(2000). 

DR EMBL; AF109122; AAF6518 9.1; -. 

DR HSSP; P00736; 1APQ. 

DR MGD; MGI : 1891209 ; Ef emp2 . 

DR InterPro; IPR000152; Asxjnydroxyl . 

DR InterPro; I PRO 01881; EGF_Ca. 

DR InterPro; IPR0Q62 09; EGF_like. 

DR InterPro; IPR0014 91; Thrmbomoduln. 

DR Pfam; PF00008; EGF; 4. 

DR PRINTS; PRO 09 07; THRMBOMODULN. 

DR SMART; SM00179; EGFjCA; 4. 

DR PROSITE; PS00010; ASX_HYDROX YL ; 4. 

DR PROSITE; PS01186; EGF_2; 4. 

DR PROSITE; PS01187; EGF_CA; 6. 

KW EGF-like domain; Matrix protein. 

SQ SEQUENCE 443 AA; 49452 MW; 5AEC2A91048B336A CRC64 ; 

Query Match 50.7%; Score 1283; DB 11; Length 443; 

Best Local Similarity 49.9%; Pred. No. 2.9e-117; 

Matches 226; Conservative 70; Mismatches 135; Indels 22; Gaps 4 
Qy 1 MPG I KR I LTVTI LALCL PSPGNAQ AQCTNGFDLDRQSGQCLDIDECRTI PEACRG 55 

: 1 1 = : i i ii ■■ ■■ -ii-i- i i i mil mum 

Db 8 LPGSLLLWAFLLLLLGAASPQDPEEPDSYTECTDGYEWDADSQHCRDVNECLTI PEACKG 67 

Qy 56 DMMCVNQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRF 115 

= 1 hi Mllhlh I = II I II :| I 

Db 68 EMKCINHYGGYLCLPRSAAVINDLHG EGPPPPAA HAQQPNPCPQ 111 



Qy 116 GYQMDESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQCLDI DECRYG 175 

! (I IMNII I | |:| | I II hi Ml : --1:1111111 

Db 112 GYE PDEQES CVDVDE CTQALHD CRPSQDCHNLPGS YQCTCPDGYRKI G PECVD I DECRYR 171 

Qy 176 YCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPG 235 

III I I : I I I • I I III I III lllll II I I MM: I Ih I 

Db 172 YCQHRCVNLPGSPRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCNQG 231 

Qy 236 YELEEDGVHCSDMDECS FSEFLCQHECVNQPGT YFCS CPPG Y I LLDDNRSCQD I NECEHR 295 

III II I I I : I I I : I - I I h I I I • I I ; i II II II M llhll I 
Db 232 YELHRDGFS CSD I DECGYSS YLCQYRCVNEPGRFS CHCPQG YQLL - ATRLCQD I DECETG 290 

Qy 296 NHTCNLQQTCYNLQGGFKCI DPIRCEEPYLRI SDNRCMCPAENPGCRDQPFTILYRDMDV 355 

I 1= III I IMM II |||:::||||hlll II Ihll =MI I * 
Db 291 AHQCSEAQTCVNFHGGYRCVDTNRCVEPYVQVSDNRCLCPASNPLCREQPSSIVHRYMSI 350 

Qy 356 VSGRS VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRP I KG PRE 415 

1 lllllhlhlll: lllll IIMII :||:!l =11 Ih lh llll 
Db 351 TS ERS VPADVFQ I QATS VYPGAYNAFQ I RSGNTQGDF Y I RQI NN VSAMLVLAR PVTG PRE 410 

Qy 416 I QLDLEM I TVNTVT NFRGSS VI RLR I Y VSQYPF 448 

lllll-hl"'.'! llhll -I I I 

Db 411 YVLDLEMVTMNSLMSYRASSVLRLTVFVGAYTF 443 



RESULT 3 
Q9H3D5 

ID Q9H3D5 PRELIMINARY; PRT; 443 AA. 

AC Q9H3D5; 

DT 01-MAR-2001 (TrEMBLrel . 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Fibulin-like extracellular matrix protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID»9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Seibold S w Marx M. ; 

RT "Cloning of a new fibulin-like gene."; 

RL Submitted (JAN-1999) to the EMBL /GenBa nk / DDB J databases. 

DR EMBL; AF124486; AAG45245.1; - . 

DR HSSP; P35555; 1EMN. 

DR InterPro; IPR000152; Asxjiydroxyl . 

DR InterPro; IPR001881; EGF_Ca. 

DR InterPro; I PRO 0620 9 ; EGF__like. 

DR InterPro; IPR001491; Thrmbomoduln . 

DR Pfam; PF00008; EGF; 3. 

DR PRINTS; PRO 09 07; THRMBOMODULN. 

DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE; PS00010; ASX_H YDROXYL ; 4. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 5. 

KW EGF -like domain; Matrix protein. 

SQ SEQUENCE 443 AA; 4 9535 MW; D91784BF36A8A060 CRC64 ; 



Query Match 50.0%; Score 1266; DB 4; Length 443; 

Best Local Similarity 4 9.2%; Pred- Wo. 1.3e-115; 

Matches 223; Conservative 74; Mismatches 134; Indels 22; Gaps 5; 



Qy 1 MPG I KRI LTVTI LALCLPS PGNAQ AQCTNGFDLDRQSGQCLDIDECRTI PEACRG 55 

- i I : =i I I! HI'h: I I I l-ll lllllhl 

Db 8 LPGSLLLWALLLLLLGSASFQDSEEPDSYTECTDGYEWDPDSQHCRDVNECLTIPEACKG 67 

Qy 56 DMMCVNQNGGYLCIPRTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRF 115 

• I I I I I 1 I I I I : I ■• III II II i 
Db 68 EMKCINHYGGYLCLPRSAAVINDLHG EGP-PPPVPPAQHPN PCPP 111 

Qy 116 GYQMDESNQCVDVDE CATDSHQCNPTQ I C I NTEGG YTCS CTDGYWLLEGQCLD I DECRYG 175 

,1 I : IIIIMII I hi [ I I I hi III : .Ml'., 
Db 112 GYEPDDQDSCVDVDECAQALHDRRPSQDCHNLSGSYQCTCPDGYRKIGPECVDIDECRYR 171 

Qy 176 YCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVWECATENPCVQTCVWTYGSFICRCDPG 235 

III I 1 I I III I = 111 Mill II I I hlhhlll I 
Db 172 YCQHRCVNLPGSFRCQCEPGFQLGPNNRSCVDVWECDMGAPCEQRCFNSYGTFLCRCHOG 231 

Qy 23 6 YELEEDGVHCSDMDECSFS EFLCQHECVNQPGT YF CSCP PGY I LLDDNRSCQD I NE CEHR 295 

III II MMIIH :||h 111 = 11 = 1111111 I lllhlli 
Db 232 YELHRDGFSCSDIDECSYSSYLCQYRCVNEPGRFSCHCPQGYQLL-ATRLCQDIDECESG 2 90 

Qy 296 NHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDV 355 

I h III I ll-hl II II Ihll :|::| I = 

Db 291 AHQCSEAQTCVNFHGGYRCVDTWRCVEPYIQVSENRCLCPASMPLCREQPSSIVHRYMTI 350 

Qy 356 VSGRSVPADIFQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPISATLVMTRPI KGPRE 415 

I I ' I I , ' I Mill M' , , HI II II: I | | I 

Db 351 TSERMRPADVFQIQATSVYPGAYNAFQIRAGNSQGDFYIRQINNVSAMLVLARPVTGPRE 410 

Qy 416 I QLDLEMI TVNTVI WFRGSS VI RLRI YVSQYPF 448 

llil|:|:|:::::| llhll -| I I 
Db 411 YVLDLEMVTMNSLMSYRASSVLRLTVFVGAYTF 443 



RESULT 4 
Q8BPB5 



ID Q8BPB5 PRELIMINARY; PRT; 493 AA. 

AC Q8BPB5 ; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel- 23, Last annotation update) 

DE Epidermal growth factor-containing fibulin-like extracellular matrix 

DE protein 1. 



OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Xiphoid cartilage; 

RX MEDLINE=22354683; PubMed=12466851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse trans criptome based on functional annotation of 



RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573 (2002) . 

DR EMBL; AK077302; BAC36738.1; 

SQ SEQUENCE 493 AA; 54952 MW; 9CEDC7BF2FF9430F CRC64 ; 

Query Match 43.4%; Score 1100; DB 11; Length 493; 

Best Local Similarity 42.0%; Pred- No. 2.8e-99; 

Matches 2 06; Conservative 72; Mismatches 161; Indels 52; Gaps 6; 

Qy S LTVTILALCLPSPGNAQ AQCTNGFDLDRQSGQCLDI DECRT I PEACRGDMMCVNQ 62 

I I ' I I = llhl- 1 II Mill = hlhl I III 

Db 5 LFLTMLTLALVKSQYTEETITYTQCTDGYEWDPIRQQCKDIDECDIVPDACKGGMKCVNH 64 

Qy 63 NGGYLCI PRTNPVYRGPYSNPYSTPYS GPYPAAAPP 98 

llllhhl : || : I = I * I 

Db 65 YGGYLCLPKTAQI I VNNEHPQQETPAAEASSGATTGTVAARSMATSGWPGGGFMASATA 124 

Qy 99 LSAPNYPT ISRPLICRFGYQMDESNQCVDVDECATDSHQC 138 

-•II | : | ih I I I hill : :| I 

Db 125 VAGPEVQTGRNWFVIRRNPADPQRIPSNPSHRIQCAAGYEQSEHNVCQDIDECTSGTHNC 184 

Qy 139 NPTQ I C I NTEGGYTCS CTDGYWLLEGQCLD I DECRY - GYCQQLCANVPGS YS CTCN PGFT 197 

hill I : 1 1 I II Ihlllll II I I I llh I hill 

Db 185 RTDQVC I NLRGSFTCQCLPGYQKRGEQC VD I DECTVP P YCHQRCVNTPGSF YCQCS PGFQ 244 

Qy 198 LNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFL 257 

I : hill I I I I I llllhh I I I I I -I hill I ' 

Db 245 LAANNYTCVDINECDASNQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYL 304 

Qy 258 CQHE CVNQ PGTYFCS CP PGY I LLDDNRSCQD I NECEHRNHTCNLQQTCYNLQGGFKCI DP 317 

ll-IIMI : I II II - H'llllllll II = hi llhl 
Db 305 CQYQCVNE PGKFS CMCPQGYE W- RSRTCQD I NECETTNE - CREDEMCWN YHGGFRCYPR 362 

Qy 318 IRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGA 377 

I-- h I •■III: I I I I I * I -hi- I - I I I I I • i I I I • I i I I 
Db 363 NPCQDHYVLTS ENRCVCP VSNTMCRELPQS I VYKYMS I RSDRS VPSDI FQI QATMI YANT 422 

Qy 378 YYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREI QLDLEMITVNTVTNFRGSSVI 437 

hllllll Mhlll hll Ih : llll : I I I h I h : : il llh 
Db 423 INTFRIKSGNENGEFYLRQTSPVSAMLVLVKSLSGPREYIVDLEMLTVSSIGTFRTSSVL 482 

Qy 438 RLRIYVSQYPF 448 

II II I 

Db 483 RLTIIVGPFSF 493 

RESULT 5 
Q8K0J4 

ID Q8K0J4 PRELIMINARY; PRT; 387 AA. 

AC Q8K0J4; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 



OX NCBIJTaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Colon; 

RA Strausberg R . ; 

RL Submitted (JUN-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC031134 ; AAH31184.1; -. 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR0062 09; EGF_like. 

DR InterPro; IPR006210; IEGF. 

DR Pfam; PF00008; EGF; 3. 

DR SMART; SM00181; EGF; 5. 

DR SMART; SM00179; EGF_CA; 5. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 5. 

KVJ Hypothetical protein. 

SQ SEQUENCE 387 AA; 43334 MW; EE88DC9D1422C1C7 CRC64; 

Query Match 37.9%; Score 960.5; DB 11; Length 387; 

Best Local Similarity 48.3%; Pred. No. le-85; 

Matches 171; Conservative 56; Mismatches 122; Indels 5; Gaps 4; 

Qy 96 A P PLSAPN YPT I SR PL I CRFGYQMDESNQCVDVDECATDSHQCN PTQ I C I NTEGGYTCS C 155 

I ! hi 1 =1 I h 11! hill : = 1 I hill I :|| I 
Db 38 ADPQRIPSNP- - SHRIQCAAGYEQSEHNVCQDIDECTSGTHNCRTDQVCINLRGSFTCQC 95 

Qy 156 TDGYWLLEGQCLDI DECRY -GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVHECATE 214 

I I Ihlllll II I I I llh I hill I : ' hill 

Db 96 LPGYQKRGEQCVDIDECTVPPYCHQRCVNTPGSFYCQCSPGFQIiAANNYTCVDINECDAS 155 

Qy 215 NPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCP 274 

Mill llllhh Mil I ::| hill I =llh:|lhll : I II 
Db 156 NQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCP 215 

Qy 275 PGY I LLDDNRS CQD I NECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEE P YLR I SDNRCMC 334 

li ■•(••llllllll I I •• |:| llh-l h hilhl 

Db 216 QGYEW- RS RTCQD I NECETTNE - CR EDEMCWN YHGGFRCYPRNP CQDH YVLTSENRCVC 273 

Qy 335 PAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGAYYIFQIKSGNEGREFYM 394 

I I I h I = hh I : 1 ilihlllhlll 1 hlllill llh 

Db 274 PVSNTMCRELPQSIVYKYMSIRSDRSVPSDIFQIQATMIYANTINTFRIKSGNENGEFYL 333 

Qy 395 RQTGPISATLVMTRPIKGPREIQLDLEM1TVNTVINFRGSSVIRLRIYVSQYPF 448 

III hll Ih : = llll : I I I h I h : : II llhll 11*1 
Db 334 RQTSPVSAMLVLVKSLSGPREYIVDLEMLTVSSIGTFRTSSVLRLTIIVGPFSF 387 

RESULT 6 
Q922K8 

ID Q922K8 PRELIMINARY; PRT; 685 AA. 

AC Q922K8; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 
DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 
DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 
DE Similar to fibulin l. 



GN FBLN1 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (APR-2001) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Urinary bladder 

RX MEDLINE=22354683; PubMed=124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 



RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573 (2002) . 

DR EMBL; BC007140; AAH07140.1; -. 

DR EMBL; AK035388; BAC29054.1; -. 

DR MGD; MGI : 95487 ; Fblnl , 

DR InterPro; IPR000020; Anaphylatoxin . 



DR InterPro ; IPR000152; Asx_hydroxyl . 

DR InterPro; I PRO 01881; EGF_Ca. 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF01821; ANATO; 3. 

DR Pfam; PF00008; EGF; 6. 

DR SMART; SM00104; ANATO; 3. 

DR SMART; SM00179; EGF_CA; 8. 

DR PROSITE; PS01177; ANAPHYLATOXIN_l ; 3. 

DR PROSITE; PS01178; ANAPHYLAT0XIN_2 ; 3. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS01186; EGF_2; 3. 

DR PROSITE; PS01187; EGF_CA; 8. 

KW EGF- like domain. 

SQ SEQUENCE 685 AA; 75283 MW; EF0D77D7F66B73B8 CRC64; 

Query Match 29.3%; Score 743; DB 11; Length 685; 

Best Local Similarity 36.1%; Pred. No. 4.1e-64; 

Matches 160; Conservative 65; Mismatches 154; Indels 64; Gaps 17; 

Qy 16 CLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNP V 75 

II = II =11 I I hlhll =11 1 = 1 I I I 
Db 290 CRP KLQCKSGFIQD-ALGNCIDINECLSISAPCPVGQTCINTEGSYTC 336 

Qy 76 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVDVDECATD 134 

= 11 I I! : =1 = 1111111 = 

Db 337 QKNVPN CGRGYHLNEEGTRCVDVDECSPP 365 

Qy 135 SHQCNPTQICINTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYS 189 

= 1 I ■ I ■• I • II I ■■ = : 1 = 11 = 11 II I I I III- 

Db 366 AEPCGKGHHCLNSPGSFRCECKAGFYFDGISRTCVDINECQRYPGRLCGHKCENTPGSFH 425 



Qy 190 CTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDM 248 

11 II i IIIIMIIII =11111 111= I I l! = ! = III I 1 = 

Db 426 CSCSAGFRLSVDGRSCEDV^EC-LNSPCSQECANVYGSYQCYCRRGYQLSDVBGVTCEDI 484 



Qy 24 9 DECS P - - S EPLCQHECVNQ PGTYFCS CP - PG Y I LLDDNRSCQDI NECEHRNHTCNLQQTC 305 

III: '-I ■• hi II- III! II I •■ hlllhll I I-- --II 

Db 485 DECALPTGGHICSYRCINIPGSFQCSCPSSGYRLAPNGRNCQDIDECVTGIHNCSINETC 544 

Qy 306 YNLQGGFKCIDPIRCEEPYLRISDNRC-MCPA-ENPGCRDQPFTILYRDMDWSGRSVPA 363 

■•Ml hi-. I I I I -I II I II I I I I ; III 

Db 545 FNIQGSFRCLS-FECPENYRRSADTRCERLPCHENQECPRLPLRITYYHLSFPTNIQVPA 603 

Qy 364 DIFQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPISATLVMTRPI KGPREIQLDLEM- 422 

= hl II I :||| I h I - : I I I II-- I I 

Db 604 WFRMGPS SAVPGDSMQLAI TAGNE EGFFTTRKVSHHSG WALTKP I P E PRDLLLT VKMD 663 

Qy 423 -ITVNTVINFRGSSVIRLRIYVS 444 

n =r nihil 

Db 664 LYRHGTVSSF VAKLFIFVS 682 



RESULT 7 
Q8C3B1 

ID Q8C3B1 PRELIMINARY; PRT; 685 AA. 

AC Q8C3B1; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE Fibulin 1. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX WCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Head; 

RX MEDLINE=22354683; PubMed=124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse trans crip tome based on functional annotation of 

RT 60,770 full-length cDNAs - " ; 

RL Nature 420:563-573 (2002) . 

DR EMBL; AK086451; BAC39669.1; 

SQ SEQUENCE 685 AA; 75282 MW; E38377D35B08C560 CRC64; 

Query Match 29.3%; Score 743; DB 11; Length 685; 

Best Local Similarity 36.1%; Pred. No. 4.1e-64; 

Matches 160; Conservative 65; Mismatches 154; Indels 64; Gaps 17; 

Qy 16 CLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCVNQNGGYLCIPRTNPV 75 

I 1 : 1 I : I I I I hlhll -I I 1 = 1 I 1 I 
Db 290 CRP KLQCKSGF I QD - ALGNCI D I NECLS I SAPCPVGQTCI NTEGS YTC 336 

Qy 76 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVDVDECATD 134 

■> li h II = :| Olllllh 

Db 337 QKNVPN CGRGYHLNEEGTRCVDVDECSPP 365 

Qy 135 SHQCNPTQICINTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYS 18 9 

= I hh i = II i- = hlhll II I I I llh 

Db 366 AEPCGKGHHCLNSPGSFRCKCKAGFYFDGISRTCVDINECQRYPGRLCGHKCENTPGSFH 425 



Qy 190. CTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDM 248 

I) ]: lltlhiiili Hi I I I !lh I I INI : III I h 
Db 426 CSCSAGFRLSVDGRSCEDWEC-LNSPCSQECANWGSYQCYCRRGYQLSDVDGVTCEDI 484 

Qy 249 DECS F - - SEFLCQHECVNQPGT YFCS CP - PGYI LLDDNRS CQD I NECEHRNHTCNLQQTC 305 

llh =| = hi lh= III! II I = hlllhll I h = =11 

Db 485 DECALPTGGHICSYRCINIPGSFQCSCPSSGYRLAPNGRKCQDIDECVTGIHNCSINETC 544 

Qy 306 YNLCGGFKCIDPIRCEEPYLRISDNRC-MCPA-ENPGCRDQPFTILYRDMDVVSGRSVPA 363 

'I'll l = h I I I I =1 II I II I |||: : III 

Db 545 FNIQGSFRCLS-FECPENYRRSADTRCERLPCHENQBCPRLPLRITYYHLSFPTNIQVPA 603 

Qy 364 DIFQMQATTRYPGAYYIFQIKSGNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEM- 422 

: I: I II I =111 I h I -hll II- I -| 

Db 604 WFRMGPSSAVPGDSMQLAI TAGNEEGFFTTRKVSHHSGWALTKPI PEPRDLLLTVKMD 663" 

Qy 423 -ITVNTVINFRGSSVIRLRIYVS 444 

II , I =1 hll 
Db 664 LYRHGTVSSF VAKLFIFVS 682 



RESULT 8 
Q8TBH8 

ID Q8TBH8 PRELIMINARY; PRT; 683 AA. 

AC Q8TBH8 ; 

DT 01-JUN-2 002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE Fibulin 1, 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RA Strausberg R. ; 

RL Submitted (FEB-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC022497; AAH224 97.1; -. 

DR InterPro; I PRO 00 020; Anaphylatoxin. 

DR InterPro; IPR000152; Asx__hydroxyl . 

DR InterPro; IPR001881; EGF_Ca. 

DR InterPro; I PRO 0620 9; EGF_like. 

DR Pfam; PF01821; ANATO ; 3. 

DR Pfam; PF00008; EGF; 6. 

DR SMART; SM00104; AKATO; 3. 

DR SMART; SM00179; EGF_CA ; 8. 

DR PROSITE; PS 01 177; ANAPHYLATOXIN_l ; 3. 

DR PROSITE; PS01178; ANAPHYLAT0XIN_2 ; 3. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS01186; EGF__2 ; 3. 

DR PROSITE; PS01187; EGF_CA; 8. 

KW EGF- like domain. 

SQ SEQUENCE 683 AA; 74423 MW; 2665A3961B6403B4 CRC64 ; 



Query Match 29.1%; Score 737; DB 4; Length 683; 

Best Local Similarity 35.4%; Pred. No. 1.6e-63; 



Matches 157; Conservative 67; Mismatches 154; Indels 66; Gaps 17; 
Qy 16 CLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCVNQMGGYLCIPRTNPV 75 

I i •11-11 I I I - 1 i • I i 1 I hi IN 

Db 288 CRP KLQ CKSGFI QD -ALGNC I D INECLS I SAP CP I GHTC I NTEGS YTC 334 

Qy 76 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQM-DESNQCVDVDECATD 134 

il I || : :| '-11111111 

Db 335 QKNVPN CGRGYHLNEEGTRCVDVDECAPP 363 

Qy 135 SHQCNPTQ I C I NTEGG YTCS CTDGYWL - - LEGQCLD I DEC - RY- - GYCQQLCANVPGS YS 189 

= I hh I'll II* : hl-ll II III 111 

Db 3 64 AEPCGKGHRCVNSPGSFRCECKTGYYFDGISRMCVDVNECQRYPGRLCGHKCENTLGSYL 423 

Qy 190 CTCNPGFTLNEDGRS CQDVNECATENPC VQTCVNTYGSF I CRCDPGYELEE - DGVHCSDM 248 

hh II |: I III hh III- Ml I I I 111= I I I hi = III i h 
Db 424 CSCSVGFRLSVDGRSCEDINECSS-SPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDI 482 

Qy 249 DECSF - - S EFLCQHE CVNQPGTYFCS CP - PGY I LLDDNRS CQDI NECEHRNHTCNLQQTC 305 

llh :| : hi lh: llll II I = =lllhll I h= Ml 

Db 483 DECALPTGGHICSYRCINIPGSFQCSCPSSGYRLAPNGSNCQDIDECVTGIHNCSINETC 542 

Qy 306 YNLQGGFKCIDPIRCEEPYLRISDNRC-MCPA-ENPGCRDQPFTILYRDMDWSGRSVPA 363 

= hlllhh M II : II I II I I M : : 11 

Db 543 FNIQGGFRCL-AFECPENYRRSAATRCERLPCHENRECSKLPLRITYYHLSFPTNIQAPA 601 

Qy 364 D I FQMQATTR Y PGAYY I FQ I KSGNEGREFYMRQTGP I SATLVMTR P I KG PRE I QLDLEM I 423 

= hl == II llll I h M = = hh II- 
Db 602 WFRMGPSSAVPGDSMQLAITGGNEEGFFTTRKVSPHSGWALTKPVPEPRDL LL 656 

Qy 424 TVNTVINFRG SSVIRLRIYVS 444 

II I I M hll 

Db 657 TVKMDLSRHGTVSSFVAKLFIFVS 680 

RESULT 9 
Q99K58 

ID Q99K58 PRELIMINARY; PRT; 1174 AA. 

AC Q99K58; 

DT 01-JUN-2001 (TrEMBLrel- 17, Created) 

DT 01-JUN-2001 (TrEMBLrel . 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel • 23, Last annotation update) 

DE Similar to fibulin 2. 

GN FBLN2 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (MAR-2001) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; BC005443; AAH05443.1; 

DR HSSP; P00736; 1APQ. 

DR MGD; MGI: 95488; Fbln2 . 

DR InterPro; I PRO 00 020; Anaphylatoxin . 

DR Inter Pro; I PRO 00 152; Asx_hydroxyl . 



DR InterPro; IPR001881; EGF_Ca . 

DR InterPro ; I PRO 062 09; EGF_like. 

DR Pfara; PF01821; ANATO; 2. 

DR Pfam; PF00008; EGF; 6. 

DR SMART; SM00104; ANATO; 3- 

DR SMART; SM00179; EGF_CA; 9. 

DR PROSITE; PS01177; ANAPHYLATOX I N_ 1 ; 3. 

DR PROSITE; PS 01 178; ANAPHYLATOX I N_2 ; 3. 

DR PROSITE; PS00010; ASXJKYDROXYL ; 5. 

DR PROSITE; PS01186; EGF_2 ; 5. 

DR PROSITE; PS01187; EGF_CA; 9. 

KW EGF -Like domain. 

SQ SEQUENCE 1174 AA; 1264 60 MW; 8D628AC710FBA6B8 CRC64; 



Query Match 29.0%; Score 735.5; DB 11; Length 1174; 

Best Local Similarity 35.3%; Pred. No. 4.2e-63; 

Matches 146; Conservative 61; Mismatches 148; Indels 59; Gaps 11; 

Qy 23 AOAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSN 82 

h =1 01 I I hlhll -Ml 1=1 t I I 
Db 782 ARQRCMDGF - LQDPEGNCVD I NECTSLLE PCRSGFS CI NTVGS YTC 826 

Qy 83 PYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQM-DESNQCVDVDECATDSHQCNPT 141 

Ihl II =! -Illhll I hi 
Db 827 QRNPLVCGRGYHANEEGSECVDVNECETGVHRCGEG 862 

Qy ' 142 Q I CINTEGG YTCS CTDG YW - - LLEGQCLD I DECR YG YCQQLCANVPGS YS CTCNPGF 196 

hi I I I I I h hl-ll |[ | | | | | I hi II 

Db 863 QLCYNLPGSYRCDCKPGFQRDAFGRTCIDVNECWVSPGRLCQHTCENTPGSYRCSCAAGF 922 

Qy 197 TLNEDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEEDGVHCSDMDECS - FSE 255 

I. I h h I I I I I I II M llh I I I hi III hhlih : 
Db 923 LLAADGKHCEDVNECETRR-CSQECANIYGSYQCYCRQGYQLAEDGHTCTDIDECAQGAG 981 

Qy 256 FLCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKC 314 

II III Ihl hll II := = llhl-ll I h =lhhll hi 

Db 982 ILCTFRCVNVPGSYQCACPEQGYTMMANGRSCKDLDECALGTHNCSEAETCHNIQGSFRC 1041 

Qy 315 I DP I RCEEP YLR I SDNRCMCPAENf PGCRD QPFTI LYRDMDWSGRSVPADI FQM 368 

I hhh :| I hi I h - =1 III II- 

Db 1042 L-RFDCPPNYVRVSETKC ERTTCQD I TECQTS PAR I THYQLNFQTGLLVPAH I FRI 1096 

Qy 369 QATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPRE I QLDLEM 422 

: I I III I h - II- 11-11 

Db 1097 GPAPAFAGDTI SLTI TKGNEEGYFVTRRLNAYTGWSLQRSVLEPRDFALDVEM 1150 



RESULT 10 
Q8MJJ9 

ID Q8MJJ9 PRELIMINARY; PRT; 598 AA. 

AC Q8MJJ9 ; 

DT 01-OCT-2002 {TrEMBLrel. 22, Created) 
DT . 01-OCT-2002 (TrEMBLrel . 22, Last sequence update) 
DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 
DE Fibulin-lc (Fragment) . 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=9534; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Brooke J.S., Cha J.-H., Eidels L. ; 

RT "Cloning of monkey fibulin-lc gene. 11 ; 

RL Submitted (JUN-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF395659; AAM90567.1; -. 

DR Inter Pro; I PRO 00 02 0 ; Anaphylatoxin. 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; I PRO 01881; EGF_Ca - 

DR InterPro; I PRO 0 620 9 ; EGF_like. 

DR InterPro; IPR006210; IEGF. 



DR Pfam; PF01821; ANATO; 1. 

DR Pfam; PF00008; EGF; 3. 

DR SMART; SM00104; ANATO; 1. 

DR SMART; SM00181; EGF; 9. 

DR SMART; SM00179; EGF_CA; 9. 

DR PROSITE; PS 01 177; ANAPHYLATOXIN_l ; 1. 

DR PROSITE; PS01178; ANAPHYLATOXIN_2 ; 1. 

DR PROSITE; PS00010; ASX_HYDROX YL ; 4. 

DR PROSITE; PS01186; EGF_2 ; 3. 

DR PROSITE; PS01187; EGF_CA; 7. 

FT NON_TER 1 1 

SQ SEQUENCE 598 AA; 65516 MW; 84 9BF018DF452B02 CRC64 ; 

Query Match 28.7%; Score 728; DB 6; Length 5 98; 

Best Local Similarity 34.8%; Pred. No. le-62; 

Matches 158; Conservative 76; Mismatches 188; Indels 32; Gaps 16; 



Qy 16 CLPSPGNAQAQ CTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCWQNGGYLCI PR 71 

1 = : 1 = = 1 1 h = 1 = I MM : I 1=11 1 = I I = 

Db 14 9 CINTVGSFRCQRDSSCGTGYEL-TEDNSCKDIDQCESGIHNCLPDFICQNTLGSFRCRPK 207 

Qy 72 TNPVYRGPYSNPYS TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMDESN 123 

I = • I : I I I - ) I ii -I 

Db 208 LQCKNGFIQDALANCIDINECLSIVSAPCPTGHTCINTEGSYTQKNVPNCGRGYHLNEEG 267 

Qy 124 QCVDVDECATDSHQCNPTQI C I NTEGG YTCS CTDG YWL - - LEGQCLD I DEC - RY - - GYCQ 178 

11 = 111 = 1 hhhll 11= : hl-ll 11 1 

Db 268 TRCDVNECAPPAEPCGKGHRCVNSPGSFRCECKTGYYFDGISRMCVDVNECQRYPGRLCG 327 

Qy 179 QLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYEL 238 

I 1 111 1 = 1 = 11 1= 111110 = 111 = : =11 1 I 1 111= 1 1 11 = 1 
Db 328 HKCENTLGSYVCSCSVGFRLSVDGRSCEDINECSS-SPCSQECANVYGSYQCYCRRGYQL 38 6 

Qy 239 EE-DGVHCSDMDECSF--SEFLCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEH 294 

= 111 1 1 = 111= =1 : 1 = 1 11 = = till 11 1 = hlllhll 

Db 387 SDVDGVTCED I DECALPTGGH I CS YRC I N I PGS FQCS CPASGYRLAPNGRNCQD I DECVT 446 

Qy 2 95 RNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRC-MCPA-ENPGCRDQPFTILYRD 352 

1 1 : = =11 = 1 = 1111:1= 1111= II 1 II 1 1 11 

Db 447 G I HN CS I NETCFN I QGGFRCL - AFE CPEN YRRSAATRCERLPCHENRE CSKLPLRI TYYH 505 

Qy 353 MDWSGRSVPADIFQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRPI KG 412 

= : 11 =1:1 = = II I 1 I I 11=11= =|:|: 



Db 5 06 LSFPTNI QAPAWFRMGPSSAVPGDSMQLAI TGGNEEGFFTTRKVS PHSGWALTKPVPE 565 

Qy 413 PREIQLDLEM- - ITVNTVINFRGSSVIRLRIYVS 444 

II- I ::| II =1 I =1 HI 

Db 566 PRDLLLTVKMDLYRHGTVS S F VAKLFIFVS 595 



RESULT 11 
073774 

ID 073774 PRELIMINARY; PRT; 704 AA. 

AC 073774; 

DT 01 -AUG- 1998 (TrEMBLrel - 07, Created) 

DT 01-AUG-1998 (TrEMBLrel . 07, Last sequence update) 

DT Q1-MAR-2QQ3 (TrEMBLrel. 23, Last annotation update) 

DE Fibulin-1, isoform D. 

GN FBLN1 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=9912 0531; PubMed=9923656; 

RA Barth J.L., Argraves K.M. , Roark E.F., Little CD., Argraves W.S.; 

RT "Identification of chicken and C. elegans fibulin-1 homologs and 

RT characterization of the C. elegans fibulin-1 gene."; 

RL Matrix Biol. 17:635-646(1998). 

DR EMBL; AF051399; AAC05387.1; -. 

DR HSSP; POO 74 2; 1HCG. 

DR InterPro; IPR000020; Anaphylatoxin . 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF01821; ANATO; 2. 

DR Pfam; PF00008; EGF; 6. 

DR SMART; SM00104; ANATO; 3. 

DR SMART; SM00179; EGF_CA; 8. 

DR PROSITE; PS01177; ANAPHYLATOXIN_l ; 1. 

DR PROSITE; PS 01 17 8; ANAPHYLATOXIN_2 ; 2. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 5. 

DR PROSITE; PS01186; EGF_2 ; 3. 

DR PROSITE; PS01187; EGF_CA; 8. 

KW EGF-like domain. 

SQ SEQUENCE 704 AA; 78137 MW; D47D5A30D5E42932 CRC64 ; 

Query Match 28.5%; Score 722; DB 13; Length 704; 

Best Local Similarity 34.6%; Pred. No. 4.8e-62; 

Matches 163; Conservative 68; Mismatches 158; Indels 82; Gaps 22; 

Qy 15 LCLPSPGN AQAQCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCVNQNGGYLCIP 70 

'•\ : II IN | | MMI : I =hl : I I I 

Db 279 I CQNTPGSFRCRPKLQCMNGFIQD-ALGNCIDINECLSTNMPCPAGQI CINTDGSYTC-Q 336 

Qy 71 RTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDE- SNQCVDVD 129 

I '\ III ::| HUM 

Db 337 RISP SCGRGYHLNEDGTRCVDVD 359 



Qy 13 0 ECATDSHQCNPTQI CINTEGGYTCSCTDGYW- -LLEGQCLDIDECRY GYCQQLCANV 184 

II- I =111 I I I I II - hIMII I | I 

Db 360 ECSSSDQPCGEGHVCIUGPGNYRCECKSGYSFDVISRTCIDINECRRYPGRLCAHKCEKT 419 

Qy 185 PGS YS CTCN PGFTLNEDGRS CQDVNE CATENPCVQTCVNTYGS F I CRCDPGYELEE -DGV 243 

1111 ill II ' llllhhlll = =11 I I I 111: I I h:| : 
Db 42 0 PGSYYCTCTMGFKLSSDGRSCEDLNECES-SPCSQECAimGSYQCYCRRGFQLSDIDGI 478 

Qy 244 HCSDMDE CS F - - £ EFLCQHE CVNQPGTYFCS CP - PGYI LLDDNRSCQD I NECEHRNHTCN 3 00 

I Mih :| |:] ||:: |:|| || | : hMlhll I h 

Db 479 SCEDIDECALPTGGHICSFRCINIPGSFQCTCPSTGYRLAPNARNCQDIDECVAETHNCS 538 

Qy 301 LQQTCYNLQGGFKCIDPIRCEEPYLRI SDN RCM - - CPAENPGC - RDQPFT I 34B 

:|hi = llll = |: = I I I = I II' I =11 II 

Db 539 FNETCFN I QGGFRCLS - LECP EN YRKSGDTVRLEKTDT I RC I KS CRPNDVNCVLDPVHTI 597 

Qy 34 9 LYRDMDWSGRSV- - PAD I FQMQA-TTRYPG- -AYYIFQI KSGNEGREF YMRQTG 398 

•.::-.|- 1=1-1111 I III II I II | 

Db 598 SHTVI SLPTFREFTRPEEI I FLRAI TPTYPANQADI IFDITEGNLRESFDI I KRYM- -DG 655 

Qy 399 PISATLVMTRPIKGPREIQLDLEM-ITWTVINFRGSSVTRLRIYVSQYPF 448 

: III II I III ■ h= I : 1 = 11 = 1 I 

Db G56 MTVGWRQVRPIVGPFHAIliKLEMNYVMGGWSHR- -NI VNVHI FVSEYWF 704 



RESULT 12 
Q9Y3V7 

ID Q9Y3V7 PRELIMINARY; PRT; 576 AA. 

AC Q9Y3V7 ; 

DT 01-NOV-1999 (TrEMBLrel. 12, Created) 

DT 01-NOV-1999 (TrEMBLrel. 12, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein (Fragment) . 

GW DKFZP586A1519- 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Uterus; 

RA Wambutt R., Heubner D. , Mewes H.W. , Gassenhuber J., Wiemann S. ; 

RL Submitted (MAR- 1999) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AL050095; CAB43267.1; -. 

DR HSSP; P00736; 1APQ. 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF00008; EGF; 6. 

DR SMART; SM00179; EGF_CA; 8. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF__CA; 9. 

KW Hypothetical protein; EGF -like domain. 

FT NON_TER 1 1 

SQ SEQUENCE 576 AA; 63274 MW; FF2F0E9B185D8AC3 CRC64; 



Query Match 28.1%; Score 711.5; DB 4; Length 576; 

Best Local Similarity 34.3%; Pred. No. 4.1e-6l; 

Matches 146; Conservative 61; Mismatches 156; Indels 63; Gaps 12; 

Qy 15 LCLPSPGN AQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI P 70 

I! : h h :| :|| ! ! hlhll - III hi I I I 

Db 172 LCQNTKGSFYCQARQRCMDGF-LQDPEGNCVDINECTSLSEPCRPGFSCINWGSYTC-- 228 

Qy 71 RTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQ-MDESNQCVDVD 129 

Mil I) h = IIIh 
Db 229 ■ QRNPLICARGYHASDDGTKCVDVN 252 

Qy 130 ECATDSHQCNPTQI C1NTEGGYTCSCTDGYVT - -LLEGQCLDI DECRYG YCQQLCANV 184 

II I hi I : I I MM I: hl = M II I I 

Db 253 ECETGVHRCGEGQVCHNLPGS YRCDCKAGFQRDAFGRGCI DVNECWAS PGRLCQHTCENT 312 

Qy 185 PGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVH 244 

III hi III Ih h-IIUI I I M III: I I IM III 

Db 313 LGS YRCSCASGFLLAADGKRCEDVNECEAQR - CSQECANI YGS YQCYCRQGYQLAEDGHT 371 

Qy 245 CSDMDECS -FSEFLCQHECVNQPGTYFCSCP- PGYILLDDNRSCQDINECEHRNHTCNLQ 302 

1 = 1=111= = II 1 = 1 Ihl hll II : = llhl-ll I h 

Db 372 CTDIDEOVQGAGILCTFRCLNVPGSYQCACPEQGYTMTANGRSCKDVDECALGTHNCSEA 431 

Qy 3 03 QTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRD QPFTI LYRDMDW 356 

= lhhll hh I h-l :| II! ||::* 

Db 432 ETCHNIQGSFRCL-RFECPPNYVQVSKTKC ERTTCHDFLECQNS PAR I TH YQLNFQ 48 6 

Qy 357 SGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRP I KGPRE I 416 

= 111111- : I I III I h : : : M ih 

Db 487 TGLLVPAHI FRIGPAPAFTGDTIALNI I KGNEEGYFGTRRLNAYTGWYLQRA VLEPRDF 54 6 

Qy 417 QLDLEM 422 

Ihll 

Db 547 ALDVEM 552 



RESULT 13 
Q8IUI1 

ID Q8IUI1 PRELIMINARY; PRT; 1231 AA. 

AC Q8IUI1; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Fibulin 2. 

GN FBLN2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Li D., Marian A.J., Roberts R. ; 

RT "Identification of a novel alternatively spliced isoform of human 

RT fibulin- 2 gene abundantly expressed in heart and genetic evaluation in 

RT patients with ARVD . " ; 



RL (in) Unknown A. (eds.); 

RL ABSTRACTS OF THE 52ND ANNUAL MEETING OF THE AMERICAN SOCIETY OF HUMAN 

RL GENETICS, pp. 323-323, Unknown Publisher (2002). 

DR EMBL; AY130458; AAN05435.1; 

DR EMBL; AY130456; AAN05435.1; JOINED. 

DR EMBL; AY130457; AAN05435.1; JOINED. 

SQ SEQUENCE 1231 AA; 131853 MW; 0A75D1C27F258D48 CRC64; 

Query Match 28.0%; Score 710.5; DB 4; Length 1231; 

Best Local Similarity 34.3%; Pred. No. l.2e-60; 

Matches 146; Conservative 61; Mismatches 156; Indels 63; Gaps 12; 

Qy 15 LCLPSPGN AQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI P 70 

II : h I = • I ■ I ( I | MM I •••• i II hi Ml 

Db 827 LCQNTKGS F YCQARQRCMDGF - LQDP EGNCVD I NECTSLS E P CRPGFSCI NTVGS YTC - - 883 

Qy 71 RTNP VYRGPYSNPYSTPYSGPYPAAAPPLSAPNYFTI SRPLI CRFGYQ-MDESNQCVDVD 129 

Mi il I: ,1 

Db 884 QRNPLI CARGYHASDDGXKCVDVN 907 

Qy 130 ECATDS HQCN PTQ I C I NTEGG YTCS CTDG YW - - LLEGQCLD I DE CRYG YCQQLCANV 184 

I I I hi hi I I I I I h ■ I || | | 

Db 908 ECETGVHRCGEGQVCHNLPGSYRCDCKAGFQRDAFGRGCIDVNECWASPGRLCQHTCENT 967 

Qy 185 PGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVH 244 

111 hi II I Ih MM! : I | I I llh I I Ihl III 
Db 968 XiGSYRCSCASGFLIi^DGKRCEDWECEAQR-CSQECANIYGSYQCYCRQGYQIAEDGHT 1026 

Qy 245 CSDMDECS - FSEFLCQHECVNQPGTYFCSCP - PGYILLDDNRSCQDINECEHRNHTCNLQ 302 

hhllh = II hi Ihl hll II = = MhhM I h 

Db 1027 CTDI DECAQGAG I LCTFRCLNVPGSYQCACPEQGYTMTANGRSCKDVDECALGTHNCSEA 1086 

Qy 303 QTCYNLQGGFKC I DP I RCEEP YLR I SDNRCMCPAENPGCRD QPFTILYRDMDW 356 

MMI! hh I h::| = | III I M - 

Db 1087 ETCHNI QGSFRCL - RFECPPNYVQ VSKTKC ERTTCHDFLECQNSPARITHYQLNFQ 1141 

Qy 357 SGRS VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREI 416 

■I III II-- M i III I I ■• : : M Ih 

Db 1142 TGLLVPAHI FRIGPAPAFTGDTIALNI I KGNEEGYFGTRRLNAYTGWYLQRAVLEPRDF 1201 

Qy 417 QLDLEM 422 

IMI 

Db 1202 ALDVEM 1207 



RESULT 14 
Q8IUI0 

ID Q8IUI0 PRELIMINARY; PRT; 1231 AA. 

AC Q8IUI0; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Fibulin 2. 

GN FBLN2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Li D . , Marian A. J. , Roberts R.,- 

RT "Identification of a novel alternatively spliced isoform of human 

RT fibulin-2 gene abundantly expressed in heart and genetic evaluation in 

RT patients with ARVD . " ; 

RL (In) Unknown A. (eds.); 

RL ABSTRACTS OF THE 52ND ANNUAL MEETING OF THE AMERICAN SOCIETY OF HUMAN 

RL GENETICS, pp. 323-323, Unknown Publisher (2002). 

DR EMBL; AY13 0459; AAN05436.1; 

SQ SEQUENCE 1231 AA; 131790 MW; 95D69EB2 082952A7 CRC64 ; 



Query Match 28.0%; Score 709.5; DB 4; Length 1231; 

Best Local Similarity 34.3%; Pred. No. 1.6e-60; 

Matches 146; Conservative 61; Mismatches 156; Indels 63; Gaps 12; 

Qy 15 LCLPSPGN AQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI P 70 

II = h h H = 11 I 1 M' - III |=| III 

Db 827 LCQNTKGSFYCQARQRCMDGF-LQDPEGNCVDINECTSLSEPCRPGFSCINTVGSYTC-- 883 

Qy 71 RTNPVYRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQ-MDESNQCVDVD 129 

III) I) 1= :||||: 
Db 884 QRNPLI CARGYHASDDGAKCVDVN 907 

Qy 13 0 E CATDSHQCN PTQ I CI NTEGGYTCS CTDG YW - - LLEGQCLDI DE CR YG YCQQLCANV 184 

II I hi hi I I I I I I hh = ll II I I 

Db 908 ECETGVHRCGEGQVCHNLPGS YRCDCKAGFQRDAFGRGC IDVNE CWAS PGRLCQHTCENT 967 

Qy 185 PGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVH 244 

III hi II I Ih hlllll : II I I I I Ihl III 

Db 968 LGSYRCSCASGFLLAADGKRCEDVNECEAQR-CSQECANIYGSYQCYCRQGYQLAEDGHT 1026 

Qy 245 CSDMDE CS - FS EFLCQHECVNQPGT YFCS CP - PG Y I LLDDNRS CQDI NECEHRNHTCNLQ 302 

1 = 1 = 111= : II hi Ihl hll II : : 1 I I = I = = I I I h 

Db 1027 CTDIDECAQGAGILCTFRCLNVPGSYQCACPEQGYTMTANGRSCKDVDECALGTHNCSEA 1086 

Qy 303 QTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRD QP FT I LYRDMDW 356 

= 11 = 1 = 11 hh I l = = = l =1 III I I : : = 

Db 1087 ETCHNIQGSFRCL-RFECPPNYVQVSKTKC ERTTCHDFLECQNS PARI TH YQLNFQ 1141 

Qy 357 SGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRP I KGPRE I 416 

= 1 III lh= = I I III I h = : : | : |h 

Db 1142 TGLLVPAHI FRI GPAPAFTGDTI ALN 1 1 KGNEEGYFGTRRLNAYTGWYLQRAVLEPRDF 1201 

Qy 417 QLDLEM 422 

Ihll 

Db 1202 ALDVEM 1207 



RESULT 15 
Q8NBH6 

ID Q8NBH6 PRELIMINARY; PRT; 638 AA. 

AC Q8NBH6; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 



DE Hypothetical protein NT2RP3003649 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RM [1] 

RP SEQUENCE FROM N.A. 

RA Ota T. , Nishikawa T. , Suzuki Y., Kawai-Hio Y., Hayashi K. , Ishii S., 

RA Saito K. , Yamamoto J., Wakamatsu A., Nagai T. , Nakamura Y . , 

RA Nagahari K. , Sugano S., Isogai T.; 

RT "HRI human cDNA sequencing project."; 

RL Submitted (MAR- 2 002) to the EMBL / GenBank / DDB J databases. 

DR EMBL; AK075566; BAC11705.1; -. 

DR InterPro; IPR000 020; Anaphyla toxin . 

DR InterPro ; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; IPR006210; IEGF. 



DR Pfam; PF01821; ANATO; 1. 

DR Pfam; PF00008; EGF; 4. 

DR SMART; SM00181; EGF; 9. 

DR SMART; SM00179; EGF_CA; 9. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS01186; EGF_2; 3. 

DR PROSITE; PS01187; EGF_CA; 8. 

KW Hypothetical protein. 

SQ SEQUENCE 638 AA; 70577 MW; EBC0DE3147A7621F CRC64 ; 

Query Match 27.5%; Score 696.5; DB 4; Length 638; 

Best Local Similarity 34.2%; Pred. No. 1.4e-59; 

Matches 161; Conservative 66; Mismatches 151; Indels 93; Gaps 22; 

Qy 16 CLPSPGNAQAQCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPV 75 

I ! : II :|| I I 1=11=11 = I I hi I I I 
Db 223 CRP KLQCKSGFI QD - ALGNCI DI NECLS I SAPCP IGHTCI NTEGSYTC 269 

Qy 76 YRGPYSNPYSTPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVDVDECATD 134 

= 11 I II = =1 =11111111 

Db 270 QKNVPN CGRGYHLNEEGTRCVDVDE CAP P 298 

Qy 135 SHQCNPTQI CINTEGGYTCSCTDGYWL - -LEGQCLDI DEC -RY - -GYCQQLCANVPGSYS 189 

= I 1=1= 1=1111= = l=l==ll II I I I III 

Db 299 AEPCGKGHRCVNSPGSFRCECKTGYYFDG1SRMCVDVNECQRYPGRLCGHKCENTLGSYL 358 

Qy 190 CTCNPGFTLNEDGRSCQDWECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDM 248 

hh II I'- 11111 = 1 = 111-.= =11 I I I tih i I 11 = 1 : III I 1 = 
Db 359 CSCS VGFRLSVDGRSCEDINECSS - SPCSQECANVYGS YQCYCRRGYQLSDVDGVTCEDI 417 

Qy 24 9 DECSF- -SEFLCQHECVNQPGTYFCSCP- PGYI LLDDNRSCQDINECEHRNHTCNLQQTC 305 

111= =1 : 1=1 |h: (III II I = | = l|ll = |l I |:: =11 

Db 418 DECALPTGGH1CSYRCINIPGSFQCSCPSSGYRLAPNGRNCQDIDECVTGIHNCSINETC 477 

Qy 306 YNLQGGFKCIDPIRCEEPYLRISDN RCMCPAENPGCRDQPFTILYRDMDW 356 

= l = lllhh 1111= 11= II | =: : = 

Db 478 FNIQGGFRCL-AFECPENYRRSAATLQQEKTDTVRCI KSCRPNDVTCVFDPVHTI 531 



Qy 



357 SGRSV- 



■ PAD I FQMQA - TTR Y PG - -AYYI FQI KSGNEGREF YMRQTG 398 



i = 10-11=1111111! ])) 

Db 532 SHTVISLPTFREFTRPEEI IFLRAITPPHPASQANI IFDITEGNLRDSFDI IKRYM- -DG 589 

Qy 399 PI SATLVMTRPI KGPREIQLDLEM- ITVNTVIIsfFRGSSVIRLRI YVSQYPF 448 

: III li I III I |:: I =h = hlhl I 

Db 590 MTVGWRQVRP I VGPFHAVLKLEMNYWGGWSHR - -NWNVHI FVSEYWF 638 



Search completed: January 9, 2004, 12:36:56 
Job time : 36.4615 sees 



